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A Theorem Concerning Noise in Electron Streams 


J. R. Pierce 
Bell Telephone Laboratories, Murray Hill, New Jersey 


(Received October 26, 1953) 


It is proposed that if Imax and 7mm are the maximum and minimum noise currents in a noise standing 
wave on an electron stream, then the product of their magnitudes cannot be smaller than 


kT. 
\Tmax| | Zmin| =aelB()(—* 
W 


eVos 


Various assumptions lead to a factor a ranging between 0.8 and 1. J» is beam current, B is band width, w is 
frequency, wg is effective plasma frequency, 7. is cathode temperature, and Vo is the voltage specifying the 


velocity of the electrons. 





T is proposed that if Jmax and Jin are the maximum 

and minimum noise currents in a noise standing 

wave on an electron stream, then the product of these 
cannot be smaller than 

w kT, 

| T nex | | Tmin| =ael »B— . 


Wa eVo 





Here w, is effective plasma frequency, J» is beam cur- 
rent, Vo is beam voltage, and T, is cathode temperature 
and a is a factor near to unity. Of course, the noise 
might be higher than this for reasons not considered 
here. 

Here linearity is assumed throughout and it is as- 
sumed that the noise current on the electron stream can 
be represented by a pair of unattenuated waves. Sup- 
pose two unattenuated modes of propagation are coupled 
by a lossless passive transducer. Suppose the modes 
carry power in opposite directions. Then if P, and Q. 
are the amplitudes of the modes at the input end, and 
if Pg and Q4 are the amplitudes of the modes at the 
output end,! 


Pa= (1+ )tei™P + kei, (1) 
Qa= (1+ k*)beiO a+ kei —Or+6) P,, (2) 


This is the most general possible relation between Pa, Qa 
and P,, Qa. allowed by the conservation of energy. 


1J. R. Pierce, J. Appl. Phys. 25, 179 (1954). 


Here it has been assumed that the power flows are 
given by 


PaP.*, —QdQa* 
PaPa*, —QdQa*. 
If we have instead power flows given by 
KaPoPa*, —KaQdQa* 
KaPaPi*, —KQ2Q4*, 


our equations become 
Pa(Ka)'= (1+#)*(Ka)tePo+h(Ka)te0, (3) 
Qu(Ka)'= (1+-#)*(Ka)¥e™Qat+k(Ka)bei*-*t% Py, (4) 


Let us now apply these relations to the case of space- 
charge waves, in the case of which P., Pa, Q. and Qa 
may be electron convection currents. It is assumed that 
the phase constants of the two waves can be written 
B= (w-Fw,)/uo. We can define an impedance K and a 
voltage V which exhibit the proper behavior at discon- 
tinuities in w,, the effective plasma frequency, and Vo, 
the beam voltage with respect to the cathode, and 
which satisfy the usual relations among voltage, current 
and power, 


Wa Vo 
K=+2— — (S) 
(é) To 
Uo 
V=—-—. (6) 
e/m 








932 cs. &, 


Here » is ac electron velocity and mp is average electron 
velocity. The upper sign applies for the fast space 
charge were and the lower sign for the slow space- 
charge wave. 

Let us consider the case of a standing wave in which 


Qa=Pa= 1 a (7) 


This corresponds to maximum current and zero voltage 
at the input. We have 


Ku? 
Pa= (—) e[ (1+ k?)'+ kei) 1 Pay (8) 


d 


. 


K.\? 
Qa= (=) eM (1+k) kei Par. (9) 


d 


We see that 
| Par| =|Qar*|. (10) 


That is, if there is a standing wave at the input there 
is a standing wave at the output. This is necessary, 
because a standing wave means no power flow, and if 
there is no power flow at the input there can be no 
power flow at the output. 

Let us now consider another standing wave at the 
input, for which 


Qe2= — Pas= — Prs/2. (11) 


This corresponds to zero current and maximum voltage 
at the input. We have 


Ka\! 
Pa Sy, eMC (14k)! kei") Pan (12) 


d 


Ka\} 
Quw=— (—) ell (14k)! kei) Poo, (13) 
= 


d 


As before, we have ~ 
| Pao| = |Qae]. (14) 


Suppose that P; and Py; [see (7) and (11) ] represent 
uncorrelated narrow-band noises. Then the resulting 
signals on the beam following the transducer will be 
of the form 


PartQa= li cos[}(8,—B,)2-¢1] — (15) 
PatQa=Ts cos 3(8,—B, fill $2]. (16) 


Here 8, and 6, are the phase constants of the P and Q 
modes following the transducer. At any z position the 
mean square amplitude will be the sum of the squares 
of the two signals of (15) and (16). It is easy to show 
that if Imax? and J min? are the maximum and minimum 
values of the sum of the squares of (15) and (16), then 


Tmax’! min? = TI? sin*(g2— ¢1). (17) 








PIERCE 


From (8), (9), (12), and (13), J1, Zs, 91, ¢2 can be 
evaluated and after a great deal of repulsive algebra we 


find 
K.\" 
T ax?! min? = (-*) PPP (18) 
Ka 
or 
T ax | I min =—| P| Pu!. (19) 
Ka 


We should note that with a suitable transducer it is 
possible to make |Jmax!=|Zmin|. This can be done 
through the use of space-charge-wave amplification and 
deamplification. Thus, if there is po noise standing wave 


| Twax|2= | Tmin|?=—| Pr} Pu}. (20) 


Ka 


Equations (19) and (20) seem to indicate that what 
might be called the “noisiness” of an electron beam 
depends entirely on the source of the electrons and not 
on the particular way in which the electrons are accel- 
erated; that is, for a beam of a given current, voltage 
and geometry, we have a choice only as to the noise 
standing-wave ratio, but |Jmax!|Zmin| will be a con- 
stant as long as the source of electrons is constant. 

Suppose that this is so. Consider a beam of electrons 
from a thermionic cathode at temperature 7.. We can 
compute |Jmax|? and |Jmin|? for the case of electrons 
accelerated quickly from a temperature-limited cathode. 
We obtain? . 

| Imax |?= 2eToB (21) 


_ w\'1 kT \? 
|Z min | ?= 2e1B( ~) ( —) ° (22) 
wat +\eVo 


Accordingly, we would expect in any case 


__ LF istal 1/w kT 
tele (=)(E). 
2el oB 2\w J \eVo 


Matters are not quite so simple, however, for we are 
making some sort of error in dealing with multi-velocity 
flow. For instance, consider matters at the cathode. 
There the fluctuations in current and velocity, J, and 2, 
are such that 


12=2¢I 0B (24) 


e kT. 
v= 4-n(=)/( )e. (25) 
To m 


Let subscripts 1 refer to cathode and quantities without 
subscripts 1 refer to beam. We have the single-valued- 
velocity relation of current to velocity 





“ 


w@ Io 
[=—- —, (26) 
Wqi U1 
2J. R. Pierce, Proc. Inst. Radio Engrs. 40, 1675-1680 (1952). 





al 


{ 
! 
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and then we obtain 
Wal Vor 
K,=2— 7a (27) _-_ | Zmin| 1 Ww kT, 
w Io a =0926(- — —). (36) 
2el oB 2 Wa eVo 
V 
Ka= 2! a (28) These examples merely show the magnitude of the error 
w I that may be involved, but do not exhaust possible 
choices of averages. We may say 
From (24)—(28) we conclude that 
rom (24)-(28) we conc \Tunax||Zmin| a w AT, G7) 
ry. at , — a= oo 3 
| mg) me 2 / ’ ‘a 
| T end wate es (-)|— (-") . (29) 2el B 2 Wa eVo 
2eloB 2\w, Uoi"m Vo 


How are we to interpret Vo; and %? We have the 
single-valued-velocity relation 





€ 
Uo? = 2—V 1. (30) 
m 
We have for the electrons leaving the cathode 
wkT. 
= (31) 
2m 
2kT. 
(167) m= — (32) 
m 
If we take 
e€ wkT, 
Uo." = 2—V 01 == f= ——. (33) 
m 2m 
then we obtain*® 
| T max! min| 1 w kT, 
eS (0.822)( -—- ‘). (34) 
2eIoB 2 Wa eVo 
If we take 
e 2kT.. 
Uo." = —)| = (0?) y= ——, (35) 
m m 


3 This same factor can be obtained following Dean A. Watkins 
Noise Reduction in Beam Type Amplifiers, Electronics Re- 
search Laboratory, Stanford University, California, Technical 
Report No. 31; and by using Llewellyn and Peterson’s equations 
to relate noise velocity and current at the potential minimum to 
noise in the beam. 


where a is a factor near to unity. 

It is interesting to consider also the maxima and 
minima of the ac velocity fluctuations, t,x and Umin. 
For a standing wave, the ratio of v to J is given by (5) 
and (6) ; the velocity corresponding to the current maxi- 
mum is found at the current minimum and vice versa. 
From (5), (6), (25), and (29) we obtain 


_— | Umin| - (“)( 2eV o1 ) 
ve w (4—r)kT, 


2(4—m)kT.\ 
x(——). (38) 


Uoi-m 





Using (29) and (36) we can write this 


oe 


! ! 
| Umax | | Umin | 2a Wa 


Ve" (4— - w 
(39) 


Either slightly different noise currents and velocities 
should be obtained in different physical situations, or 
else there is some proper way of using averages which 
makes a unity. 

Strictly, the foregoing relations apply only when 
there is not enough space charge to cause a potential 
minimum in front of a cathode. They will apply in the 
case of a potential minimum only if the noise in the 
convection current passing the potential minimum is 
shot noise. The writer is inclined to believe that in the 
centimeter range this is the case, and that the relations 
apply for space-charge-limited flow. 
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The Transfer Function as a Tool in the Analysis of a Resistance-Capacitance 
Coupled Voltage Amplifier 


D. L. Minc anp R. W. THorPe 
Aerojet Engineering Cor poration, Azusa, California 
(Received July 13, 1953) 


The frequency response of a typical resistance-capacitance coupled voltage amplifier has been theoreti- 
cally analyzed and the result compared with that given by Terman. The advantages and disadvantages of 
the two methods have been illustrated both mathematically and graphically. A solution to the phase-shift 
problem at any frequency considered has been included. 





I, INTRODUCTION 


A THEORETICAL analysis was made on the fre- 
quency response of a typical resistance-capaci- 
tance coupled amplifier and the result was compared 
with that of the most common analytical method ad- 
vanced by Terman.' For an amplifier of this type, the 
response curve rises and falls at the same rate for the 
respective low- and high-frequency signals. 

In Terman’s method, the total equivalent circuit of 
the system breaks down into three separate circuits: 


A high-band, a mid-band, and a low-band circuit. The 


gain points of the system are computed from the 
applicable equivalent circuits, and the over-all response 
curve is plotted using a set of universal curves. Although 
this method is based on several approximations, it has 
been shown to be valid for amplifiers of standard design. 
For an uncommon design, involving extremely high- or 
low-value components or unusual ratios between com- 
ponent values, Terman’s method may not hold. More- 
over, it offers no convenient way of determining the 
phase shift, since the mid-band equivalent circuit com- 
pletely eliminates all reactances, and the 180° phase 
shift indicated occurs only at one particular frequency. 


6AVE 











+300 
(9) SCHEMATIC DIAGRAM 
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(b) EQUIVALENT CIRCUIT 


Fic. 1. Circuit diagram of a typical resistance-capacitance 
coupled voltage amplifier. 


'F. E. Terman, Radio Engineering (McGraw-Hill Book Com- 
pany, Inc., 1947), third edition. 


The theoretical treatment herein presents a method 
which provides an accurate solution at any frequency 
considered, and gives complete phase-shift information. 
However, this method is not intended to replace other 
methods now in use. It is offered merely as an additional 
tool for precise determination of phase shift or accurate 
calculation of gain. 


II. DISCUSSION 


Based on the present analytical method, the sche- 
matic circuit diagram of a typical resistance-capacitance 
coupled voltage amplifier using the triode section of a 
6AV6 receiving tube is presented in Fig. 1 (a), with the 
total equivalent circuit shown in (b). It will be noted 
in (b) that both the bias voltages and the power supply 
voltages are eliminated in this system, but the signal 
circuit is retained. The values of the circuit components, 
however, are typical for this tube in either case. 

The voltage, current, and impedance relationships, 
represented in matrix form and properly applied, give 


Ce 
“tT —o 





AAA 
VY 
AA 
VY 
es) 
oe 
it 
W 
o 




















VWYV 
AMA. 
wWVV 
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AAAA. 
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Fic. 4. High-band equivalent circuit. 
c 
f, c 
on r ° 
> 

3 rR $ R, e& 

ee 














Fic. 5. Low-band equivalent circuit. 


Fics. 2-5. Terman’s method as applied to the same circuit. 
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rise to an expression for the gain of the stage, expressed 
as a function of frequency. This function will be of 
the form 


€o —AS 


—— ’ (1) 
es (T)S+1)(T2S+ 1) 





where the quantities are defined under Definition of 
Symbols. 

Equation (1) yields complete information on ampli- 
tude and phase with respect to frequency. In using the 
quadratic formula for finding the factors of the de- 
nominator, (7;S+1) and (J.S+1), extreme care must 
be exercised to avoid the introduction‘ of appreciable 
errors. In the case of the present example, it was found 
preferable to use a trial-and-error method, as shown in 
the calculations. Since slide-rule accuracy is not suffi- 
cient, logarithms were used in computing the answer. 

For purposes of comparison, the same circuit was 
analyzed using Terman’s method, and the results, ex- 
pressed in db, were plotted to the same graphical scales 
as before (Figs. 2-5). The two gain curves differed by a 
maximum of one db, with the —3 db points occurring at 
substantially the same frequencies. Phase information, of 
course, was not available from Terman’s method. The 
equivalent circuit shown in Fig. 1(b) may be repre- 
sented as that given in Fig. 6. 











‘Soe 





























% 
. Re /CeS 
es Ret a3 
| | | 
Fic. 6. Modified total equivalent circuit. 
Let 
Ti = R uC p 
and 
T2=R¢Ce 
then, 
R L/C pS R L ( 
= 2) 
Rr+(1/CeS) (T:S+1) 
and 
Re/Ce¢S Re 
(3) 





Rot(1/CoS) (T2S-+1) 


Then, in matrix form, 


R R, 
qualtes - apaitee - 
+ Tei 7, Si I. Hes 


R R 7 » 4) 
Tse [tsb TSH + FSi , ° 




































































where 
Rr 
me 
T,S+1 
I,= ’ 
Rt 1 Rt Re Rt P 
ae eee 
TyS+14LC.5S T,S+1 TS+1 T,S+1 
Re 
€o=1e 
TS+1 
Then 
RiRe¢ 
i 
€0 (T,S+1)(T2S+1) (5) 
es e r RL r Re Ri 7 RiRe 
ami ol dl + sspeni 
CS TS+1 TS+1 CS(TS+1) (T:S+1)(TS+1) 
and, expanding, rearranging, and collecting the terms 
RiRe 
pte 
€0 Ritr, 
as (6) 








es feliT2 fpRiC-T2 tpReC-T) Trl, 
s| + + +S 
tot+Ry fotRr rpt+Rr 


rpT2 tpRuCe tpRoCe Rx: oy , 
1 1 eo | -. +1 


as a e a 7 bs 
rot Ri, rR, fot Rr rot Rt Tot Rt rot Rt 








936 ms Bs 








MING AND R. W. THORPE 
Assign the following values: 
T,:=RiCp, , 
T2=R¢Ce, K.= _ 
rot Rit Ks5= K2K;, 
T;=R14C., 
K3,=fp (K4+ K;=1). 
T,=RcC., 
Ky= Ki Kz, 
K,=R1, 
By substitution: 
€o —pAKyT,S 


FO: < Se (7) 


e, A SUK s(11T2+ 7273+ T:T,) ]+SC T2+Ts+K3(T,+T3) J+ 1 


Equation (7) is the general expression for the transfer 
function of the circuit shown in Fig. 6. 

For the example presented here, the following values 
were chosen for the components: 


p= 100, C.=0.006X 10-*, 
rp=0.11X10°, Rg=0.47X 10, 
Cp=0.8X10-", Ce=Cext+Cep(1+A)=140X 10-", 


R,=0.47 X 10°, 


where the resistances are given in ohms and the capaci- 
tances in farads. Then, numerically the quantities are 


T,=0.376X10-®, K.=1.722XK 10, 


‘»=65.9X10-*, K;=0.11X108, 
T;=2820X10-*, K,=0.8103, 
T,=2820X10-*, K;=0.1897. 


K,=0.47X 10°, 
Substituting these values in Eq. (7), one obtains 
€o —0.228S 


e, S0.03538X 10-*]+S[ 3421 10-*]4+41 
This expression is of the form 
€0 —AS —AS 
@ BCS+(B+C)S+1 (BS+1)(CS+1) 

where BC =0.03538 X 10-*= 35 380 10-” and 


B+C=3421X10-*. 








(8) 


A trial-and-error method of simultaneous solution 
using logarithms yields 





B=3411X10-6 
and 
C=10.38X 10-6, 
eo —0.2285 
e, (3411X10-®S+1) (10.38 10-6S+ 1) 
—0.228S 





; : 
——_+1) 
96 340 


s 
(Ss) 
293 





This latter expression may be represented as a single 
vector, having an absolute amplitude | R|, and a phase 
angle of ¢. The amplitude expressed in db may be found 
as shown in Eq. (9). 


9a ; ae Te oe 
—_ i253) na (seed ")? 


' 2 3 + 


IRi(db)= 20L 0g 0.228 + 20Logw + 20L0 + 20L0g 





(9) 


where 


(1) = 20 log (0.228) = — 13.84, 
(2) = 20 logw, 
slope= +20 db/decade, through R=0, w= 1, 
(3)=0, when w<1, 
= 20 log(1/w), when w>1. 


The breakpoint occurs at 293 radians/second, and 
the slope is equal to — 20 db/decade. 


(4)=0, when w<1, 
= 20 log(1/w), when w>1. 


The breakpoint occurs at 96 340 radians/second, and 
the slope is equal to — 20 db/decade. 

From this information, the curve of gain (|R|) vs 
frequency may be plotted as shown in Fig. 7. 

The phase shift may be determined as shown in 
Eq. (10). Thus, 





= —90° —Ton (34x 10-8 w) — Ton” (10.38 x 10-%w) 
Lt v Ju ~ J (10) 
e >, %, 


where the values of the quantities are as tabulated in 
Table I. From this information, the curve of the phase 
shift vs frequency may be plotted as shown in Fig. 7. 


(ReR1/Ret+R1) 
Mid-band gain =u 
ReRt 


Ret+Ri 





fot 





= 68.1. 


Expressed in db, gain= 36.7. 





Fi 


wl 
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LJ — RADIANS / SECOND 


Fic. 7. Frequency response curve from transfer function method 


The low-frequency half-power point occurs when 


Rif» 
Xee= Re+———= 0.5591 X 108 
R L+Tp 
which is at frequency 
10° 





= 47.5 cps. 


J+J (0.006) (0.5591 108) 
= 298 radians/second. 
The high-frequency half-power point occurs when 
“— Re¢Rir> 
© ReRitRirytrRe 





=0.075X 10° 
which is at frequency 


10" 
fr= —=15 170 cps 
(0.075 X 10°) 27 (140) 





=95 400 radians/ second. 











TABLE I 

w or. o2 $3 > 
1 —90 0 0 = J 
10 | 2 0 — 92 
10 19 0 — 109 
10% | 73 1 — 164 
2X 10° 82 1 —173 
10* 88 6 — 184 
3X 104 89 17 — 196 
105 90 45 —225 
106 90 85 — 265 
10? 90 89 — 269 








«0 





to* 0” 


UJ = RADIANS /SECOND 


Fic. 8. Frequency response curve from Terman’s method. 


The two half-power (— 3 db) points and the mid-band 
gain being known, Terman’s universal curves can be 
used to plot the frequency response as shown in Fig. 8. 


DEFINITIONS OF SYMBOLS 
C.= Blocking capacitor. 
Cex=Capacitance between the grid and cathode of 
following stage. 
Cqgp= Capacitance between the grid and plate of follow- 
ing stage. 
Cx=Cathode bypass capacitor. 
Cp=Output capacity of tube. 
I,=Loop current as indicated in Fig. 6. 
I,=Loop current as indicated in Fig. 6. 
K,=A constant, = R_, 
K.=A constant, =1/(r,+Rz). 
K;=A constant, =7p. 
‘s=A constant, = K,K2=R_z/(r,+R1z). 
K;=A constant, = K2K3=r,/(r,+R1z). 
|R| = The absolute amplitude of the transfer function, 
without regard to phase angle. 
R«= Grid leak of following stage. 
Rx=Cathode bias resistor. 
R= Plate load resistor. 
S=Operator (jw). 
T,= Time constant of R, and C>. 
T.= Time constant of Rg and Cg. 
T;= Time constant of C, and Rr. 
T,=Time constant of C, and R,. 
eo= Output voltage. 
e;= Signal (input) voltage. 
{1=Low frequency half-power point. 
fo=High frequency half-power point. 
r,=ac Plate resistance of tube. 
u=Amplification factor of tube. 
¢o= Phase angle, 
w= Frequency in radians per second. 
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In the first part the discontinuity conditions in an arbitrary continuous material are deduced for a general 
(i.e., possibly curved) discontinuity surface. It is then shown that only three types of discontinuities are 
possible—shocks, contact discontinuities, and phase-change fronts. In the second part the acoustic dis- 
continuity conditions are deduced and specialized to a perfect fluid without heat conduction. Then a first- 
order partial differential equation is obtained for the location of an acoustic shock front. This equation can 
be solved, as in optics, by means of rays. The variation of shock strength along a ray is then determined 
(this is one “es result of this paper). Coefficients of reflection and transmission for an acoustic shock at a 


contact disc 
of the shock tube. 


ntinuity in the basic flow are also obtained. Finally, the results are exemplified by an analysis 





I. INTRODUCTION 


NE of the main difficulties in the theoretical an- 

alysis of shock wave problems is that the shock 
front and the flow behind it must be determined simul- 
taneously, since they affect each other. However, for 
very weak (acoustic) shocks the interaction is so slight 
that the motion of the shock front and the variation in 
its strength can be determined independently of the 
rest of the flow. This fact is deduced in the present 
report, and the main consequences of it are also de- 
termined. The result is that weak shocks can be an- 
alyzed by the methods of geometrical optics, and for 
this reason the theory of weak shocks may be called 
geometrical acoustics. Another acoustical phenomenon 
—the propagation of periodic sound waves of high 
frequency—can also be analyzed by the methods of 
geometrical acoustics, and this analysis will be pre- 
sented in another report. 

Although it has often been noticed that certain shock 
wave phenomena could be described in optical terms, 
there does not seem té be any complete derivation of 
geometrical acoustics in the literature. The present 
derivation begins with a deduction of the discontinuity 
conditions for curved discontinuities in any continuous 
medium, and continues with the analysis of the possible 
discontinuities in arbitrary media. Then the acoustic 
equations are deduced as the variational equations of 
the nonlinear fluid dynamic equations. In the same way 
the acoustic discontinuity conditions are obtained, and 
it is shown how the geometrical optics, considerations 
of wave fronts, rays, etc., apply to the determination of 
shock waves in perfect fluids. Finally, as one of the 
main results, the variation of shock strength along a ray 
is determined. This result is extended to the determina- 
tion of the reflection and transmission coefficients for 
shocks incident on contact discontinuities (boundaries). 

For the most effective use of the present theory it is 
important to take account of diffracted rays and wave- 

* A revision of notes on lectures on Mechanics of Continuous 
Media delivered at New York University in the fall of 1949 and 
at the University of California (Berkeley) in June, 1951. 

¢ This report represents results obtained at the Institute for 


Mathematics and Mechanics, New York University, under the 
auspices of Contract No. nr-285(02). 


fronts, which can be done by slightly refining the meth- 
ods of ordinary geometrical optics. An explanation of 
this refinement will be presented in a future report, 
although use has already been made of it.’ 

The present analysis is patterned after that of R. K. 
Luneburg’s study of the propagation of discontinuities 
in electromagnetic theory.” 


II. DISCONTINUITY CONDITIONS IN CONTINUUM 
MECHANICS 


A. Fundamental Equations 


The fundamental equations of continuum mechanics, 
in their integral form, are 


d 
— f sav+ f o(u—vasi=0, Mass (1) 
dt Jy Ss 


d 
— f pudV = f pfidV 
dt Vv v 


+ f CP s—eus(u—2) Ms, Linear momentum (2) 
s 


d 
— f o(etiueydv= f ofandv 
dt ¥y Vv 


+ f [7 ;:—qj—o(+4u?)(uj—)) WS, Energy (3) 
Ss 
ad 


— JF p(Xinitigo—Xipettiga)dV 
dt Jy 


[ J P(Xittfite—Xipefin)dV 
y 


+f [xigi T i+2, j— Wipe] igs, 5 
Ss 


— p(Xi4Mipe— Xi4 241) (Uj—0,) dS ;, 


Angular momentum (4) 
1 See, for example, Albert Blank and Joseph B. Keller, Comm. 
Pure Appl. Math., Vol. IV, No. 2 (1951). 
2 See, for example, R. K. Luneburg, “Mathematical theory of 
optics,” Brown University (1944); “Propagation of electromag- 
netic waves,” New York University (1949). 
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© f psav— f [99-+0s(u—2) 5.20, 

i . Second law (5) 
e=e(p,s), First law (6) 
0=€,. (7) 


In these equations S(¢) is the surface of an arbitrary 
volume V(t), x; are Cartesian coordinates, i takes the 
values 1, 2, 3 (mod 3), p is density, 1; is particle velocity, 
s is entropy per unit volume, « is internal energy, /; is 
external force (both per unit mass), 7;; is the stress 
tensor, g; is the heat flow vector, and @ is temperature. 
The summation convention is understood and all quan- 
tities are functions of x;, t. If the equation of S(t) is 


$(x;,t)=0, (8) 
then the unit normal v; has components 
vi=dri(pr;?)*. (9) 
The velocity 7; of the surface is then defined by 
v= —vidi(r;)—*. (10) 


It is assumed that the normal v; points out of V, and 
that dS;=v,dS where dS is an element of area of S. 

The preceding equations can be transformed into 
other forms by means of the two transformation 
theorems 


ditJy 


[ @ozav= f Bas. (12) 
v s 


For the validity of the first theorem it is necessary 
that A be continuous, and we will have to make use of 
this fact. 

Applying (11) to the left side of (1) yields 


foav f ooas.t f o(u—v)asi=0. (13) 
Vv Ss s 


Now making use of (12) in (13) we find 


d 
— adv=f aav+ f Av dS;, (11) 
Vv Ss 


f Cort (pu,)x; dV =0. (14) 


Since (14) holds for an arbitrary volume V, we have 
when the integrand is continuous 


pit (pu;)2;=0. (15) 
Proceeding similarly with (2)—(5) yields 
Uiyetu;(u,)2zj—p (7 ;;)2;= Sis (16) 


etuseritp'L(qi)4i— T ;;(u,)2;]=9, (17) 
T,;= T 55 (18) 
p(Situiszi)+ (¢8)2:>0. (19) 


Equations (15)—(19) are the differential forms of the 
fundamental equations of continuum mechanics, and 
apply only when all the functions p, u;, etc., are con- 
tinuous. 

If any of the functions p, “;, etc., has a jump dis- 
continuity across a surface S’, the above differential 
equations do not apply on S’. In this case we choose a 
region V which is divided into two parts, V; and V2, 
by S’. The boundary of V; consists of S’ and another 
part which we call S,; similarly, V2 is bounded by S’ 
and S». Since S’ is a discontinuity surface, we indicate 
the limits which functions attain on approaching S’ 
by a subscript one or two, according as S’ is approached 
through V; or V2. We now apply (1) to each of the 
regions V, V;, and V2 and then subtract the equations 
for V; and V2 from that for V. We thus obtain 


-f [o(ui—2) hdSa’— f [p(ui—v,) Jodie’ =0. (20) 
Ss’ Ss’ 


Combining these integrals, and noting that dS,’ 
= —dSj', we obtain 


f {[p(u:—v;) i—[[(u;—0,) Jo} dS 52’ =0. (21) 


Since S’ is an arbitrary part of the discontinuity sur- 
face, we have when the integrand in (21) is continuous, 


Lo(ui—v;) }ivi=Lo(ui—2;) Jovi. (22) 
In eactly the same way, (2), (3), (5) lead to 
LT i;—pui(uj—v,) ]iwj=LTi;—pus(uj—v;) Jov;, (23) 


[7 ju:—qj—p (e+ 3u?) (uj—v,) iv; 
=(T7jj;ui—qj;—p(e+}3u2)(uj—0;) Jov;, (24) 
(gO'+ ps (u;—v;) ]ivi>[q.8-'+ ps (us—2;) Jovi. (25) 


In deriving (25) it is assumed that »; points toward side 
one; if not, the inequality must be reversed. The equa- 
tion obtained in the above manner from (4) is an im- 
mediate consequence of (24). Thus, the discontinuity 
conditions implied by (1)-(5) are (22)-(25). These 
conditions, together with the differential relations (15)- 
(19), also imply the integral Eqs. (1)-(5). (The 
derivation of (25) requires a proof that the left side of 
(5) computed for V is greater than the sum of the same 
expressions computed for V; and Ve. This follows di- 
rectly from (19).) 


B. Shocks, Contact Discontinuities, and Phase 
Change Fronts 


The jump conditions (22)—(25) may be rewritten in 
the form 
piU;=pU 2 or p2v,;Au;= saa U,Ap, (26) 


piU ,Au;= vj;AT;;, (27) 
pU A(e+ 5?) — vjA(T u;—q;) =9, (28) 


piU,As+ viA(qd)>0. (29) 








Here we have introduced the normal velocities U;, U2 
of the surface relative to the medium on sides one and 
two, respectively, and the “jump” A, by 


Ag=g1— 82, (30) 
U.= (ti ti) 3, (31) 
Ue= (v;— Uj2)v;. (32) 


If U,=0 the surface moves with the medium and the 
discontinuity is called a contact discontinuity. In this 
case, since p; and p» are not zero, (26)—(29) become 


U.=0 or vAu;=0, (33) 
vj;AT;;=0, (34) 
vj Aq j= v;T i jAni, (35) 
viA(ge")>90. (36) 


Equations (33), (34) show that the normal velocity 
and the stress on the surface are continuous, while (35) 
shows that the jump in normal heat flux equals the 
jump in power of the normal stress. 

If U0, (27) may be written 


Au;= (p,U 1); AT;,;. (37) 


If in addition, Ap+0, the discontinuity is called a shock. 
From (37), (26) we find U,, the normal velocity of the 
shock relative to the medium on side one: 


—pwvjATj;\3 
Us (——*) , (38) 


pidp 


From (38), (26) we find that U is given by (38) with 
subscripts one and two interchanged. 

If U,40 and Ap=0, the surface is called a phase 
change front. Equation (37) applies, while (26), (28) 
yield 

vjAu;=0, (39) 


vjA(u;T ;;—- :) 
— j i - . (40) 
pid(e+ }u,*) 





Equation (39) shows that the normal velocity is con- 
tinuous, while (40) gives the normal velocity U; of the 
surface relative to the fluid, which is also equal to U2. 
Such fronts occur in melting, freezing, evaporation, and 
condensation, etc. 


C. Perfect Fluids 
A perfect (or nonviscous) fluid is defined by 





T i;= — pdi;. (41) 
Here p is the thermodynamic pressure, defined by 
de 
p= = pe. (42) 
d(p) 
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For incompressible media (42) is meaningless, and then 
p is intreduced as a new dependent function. 

For a perfect fluid (18) is automatically satisfied 
while (16), (17), (19) can be reduced to 


Mie; (U,)ej+ppri= fi, (43) 
1 

Sit UjS2;= ——(qi)zi, (44) 
po 

gr; <0. (45) 


In deducing (45), the assumption that 6>0 is made 
use of. 


At a contact discontinuity (34)—(36) become 


Ap=0, (46) 
vjAq;=9, (47) 
giv: A(O)>6. (48) 


Thus, p and the normal heat flux are continuous, while 
heat flows from higher to lower temperature at a con- 
tact discontinuity. 

At a shock (37) and (38) become 





Au;= —_ vi(piU1)“"Ap, (49) 
pAp\} 
Uys +( 4 (50) 
piip 


At a phase change front (49) applies, and this with 
(39) implies that #; and p are continuous. Then (40) 
becomes 

— vj Aq 
U»=——. (51) 





pide 


If no heat conduction occurs, then a phase change front 
cannot occur in a perfect fluid, since (51) would imply 
Ae=0, and then all quantities would be continuous. 
(U, cannot vanish, by definition of a phase change 
front.) 


Ill. GEOMETRICAL ACOUSTICS—WEAK SHOCKS 


A. The Variational or Acoustic Equations 


Let us consider a one-parameter set of solutions of 
the equations of continuum mechanics depending upon 
a parameter 7. We assume the solutions are differen- 
tiable with respect to 7 at n=0 and we differentiate (6), 
(7), (22)-(24), (42) with respect to 7 at »=0, obtaining 


pit (pu ;+ pit;)2i=0, (1) 
ee p 
Wiyt u(t ‘)ajt u;(%;)rj— (7 )z3+—(Tij)25= Sis (2) 
1 p oe 
ebitieritmérst+—[ (qi2i— 7 ij(ui)2j— T(t) 2;] 
p 


a —Lighe- T;(ui)zi;]=0, (3) 
2 
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Ti5=T ij, (4) 
é=e,f+e,8, (5) 
B= pobt pss, (6) 
6=0,p+0.8. (7) 


In these equations a dot denotes differentiation with 
respect to 7. These equations are linear in the deriva- 
tives p, 8, u;, etc., which are called the variations of the 
corresponding undifferentiated quantities, or the acous- 
tic density, entropy, velocity, etc. The equations are 
called the variational or acoustic equations, and the 
undifferentiated quantities are called the basic flow. 
It is to be noted that a solution of the homogeneous 
variational equations (with f;=0) is given by the x; 
derivative of a solution of the original equations (with 
fi = 0) ° 

In order to differentiate the discontinuity conditions, 
we must take account of the fact that the discontinuity 
surface may depend upon 7. Thus, from II-(8) we have 


Since only normal displacements of the surface need 
be considered, we may satisfy (8) by setting 


“;=- vib(oz;7)-3. (9) 
Making use of (9), we obtain from (22)—(25) 
[ pov;Au;+ U Ap |" 


—[pov;Au;+ U Ap Jrjvj6(o2:7)*=0, (10) 
[piU Au;— vjATj;] 
—[p,U,Au;— vjAT ;; \rivid (ri?) *=0, (11) 


[oU A (e+}u?)—A(Tijui— 5) v5} — LeU A (e+ 30) 
— A(T; ui— qj) vj |revib (G22) *=0. (12) 


The inequalities yield no definite results when differen- 
tiated unless the equalities hold for »=0. Then they 
also hold for the derivatives, unless only positive 7 are 
considered, in which case the inequalities hold for the 
derivatives. Equations (10)-(12) are linear in the 
acoustic quantities, and may be called the acoustic 
discontinuity conditions. 


B. Acoustic Discontinuities 


When the basic flow is continuous (i.e., p:=pe, 
U,=Uz, etc.), (10)—(12) simplify to 


pv;Au;= — UAg, (13) 
pUAu;= vjAT i;, (14) 
pUA(e+ U;*1;) = vjA(T uit Ti i—- qi). (15) 


By using (14), we may simplify (15) to 
pUAe= vj(T;;Au;— Aq;). 


(16) 
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If U=0, we have an acoustic contact discontinuity, 
and (13), (14), and (16) become 


v,Au;=0, (17) 
yjAT ;;= 0, (18) 
v;Aq;=0. (19) 


If U¥0 but Ap=0, we have an acoustic phase change 
front, and (13) becomes (17), while (14) and (16) 
become . 

Au;= (pU);AT;;, (20) 


U= (pAe)—'v;(T;;Atu;— Agq;). (21) 


If U¥0, Ap+0, we have an acoustic shock, and (14) 
becomes (18), which, with (13) yields 


—vv,AT;; 4 
ina ey 
Ap 


In a perfect fluid the following further simplifications 
result: 


(22) 


Ap=0 contact discontinuity, (23) 
Au;=—(pU)-'v;Ap — shock or phase front, (24) 
U=+ (Ap/Agp)! shock, (25) 

U = —(pAe)—'v;Aqg; phase front, (26) 
pU@As= — vjAq; any discontinuity. (27) 


The last equation follows when II-(7), II-(42), and 
III-(5) are used in (16). 

If we now consider a perfect fluid without heat flow 
(q,=0), then phase fronts become impossible, and at a 
shock (27) yields 


As=0. (28) 


Using (28) in (5) yields Ap=p,Ap, and thus from (25) 
we have for the velocity of an acoustic shock 


U=+(p,)'=+c. (29) 


The quantity c=(p,)! is called the sound speed, and 
it may vary with position and time since it depends 
upon the basic flow. 

If the definition of U in II-(31) or II-(32) and II-(9), 
II-(10), are used in (29), we find. 


(b+ j2;)"=C(oz;7). (30) 


This is a first-order partial differential equation which 
must be satisfied by an acoustic shock in a perfect, 
nonconducting fluid. The coefficients depend upon the 
basic flow. 

If ¢:~0, we may write the equation ¢=0 in the 
form 


t=W(x;,). (31) 
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Then (30) becomes an equation for W: 
(uj;W2j;—1)?=CW2;?. (32) 


If «;=0, (32) becomes the eiconal equation 


1 
—= (W2z;)?= We?+Wee+ We. (33) 


C 


If u;=constant, (32) can be reduced to (33) by making 
use of the change of variable #;= x;— ut. Equations (32) 
and (33) apply only when c and u; are independent of ¢, 
which we will henceforth assume to be the case. 


C. Wave Fronts and Rays 


The acoustic shock surfaces W=constant will now 
be called wave fronts because the integration of equa- 
tions such as (32) has been studied in geometrical 
optics where that term is used. There is, of course, a 
complete theory of such equations, and we will merely 
specialize it to this case. 

First we define the wave normal p; by 


Pi = Wx. (34) 
From (32), (34), p; satisfies the quadratic equation 
(uipi—1)?=Cp?. (35) 


At each point in space (35) defines a surface containing 
the endpoints of the possible wave normals p;. This is 
called the surface of wave normals. The reciprocal of the 
radius from the origin to this surface in any direction is 
the velocity »v in that direction of a possible wavefront, 
since by I-(10), I-(31), I-(32) 


v= (p?)-+. (36) 


By examining (35) one finds that the wave normal 
surface is a surface of revolution about the direction 1;. 
Its section by a plane containing this direction is a 
hyperbola, parabola, or ellipse, according as the basic 
flow is supersonic, sonic, or subsonic (i.e., according as 
u=(u?)'=c). If u;=0, the wave normal surface is a 
sphere of radius c~. If 6 denotes the angle between p; 
and #;,- then in the subsonic case there is one wave 
normal for each value of #, and the wave-front velocity 
v=c-+u cos#. The same applies to the sonic case, except 
that there is no wave normal for @=7. 

In the supersonic case there are two solutions ; for 
each direction if0<@<cos~(c/u), and then »= u cosé-c, 
the upper sign applying to that branch of the surface 
nearer the origin. When cos~'(c/u)<@<cos-(—c/u), 
there is one solution ~; for each direction, and »=c 
+4 cosé. If cos~'(c/u)<@ there is no solution. These 
results are readily understood in terms of vectorial 
addition of u; and 2;. 

The Hamiltonian function H(x;,p;) is defined by 


H(x:,pi)=Cp?— (1—ujp;)’. (37) 
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In terms of this, certain curves x,;(0), which we call 
rays, are introduced by the equations 


dx; 


——=)H»p;= 2Lepi+ uic(p;*)* }. (38) 


da 


Here X is an arbitrary nonzero factor. From the last 
expression in (38) we see that the rays are orthogonal 
trajectories of the wave fronts if and only if #; is zero or 
parallel to p;, since then dx;/do is parallel to p;, which 
is a wave normal. 

From the general theory of first-order equations it 
follows that the rays satisfy -Hamilton’s equations, 
Lagrange’s equations, Fermat’s principle, etc. Further- 
more, by means of the rays we can construct the solu- 
tion of (32) corresponding to any given initial data. 
The wave fronts can also be found by Huygen’s 
construction. The vector dx;/do also lies on Fresnel’s 
ray surface, which is reciprocal to the wave normal 
surface. 


D. Variation of Shock Strength Along a Ray 


To determine the variation of shock strength (i.e., 
Ap, Ap, etc.) along a ray, we must consider the acoustic 
equations for a perfect, nonconducting fluid. By special- 
izing (1)-(3), (6), or by differentiating I-(43), I-(44), 
we obtain 


Prt (pu;+ pii;)x;=90, (39) 
Uipt+ ti; (uy)2j+ u;(t j2jtp pri pp pzi= fi, (40) 
StU jSzj+U82;=0, (41) 


p _ pst Pob- (42) 


We now consider the quantities w;(*;,W («;))=a;(x,), 
p(x;,W (x;)) = p(x,), §(x;,W (x,;)) =38(x;), and p(x;,W(x;)) 
=p(x;). These are the values of w;, 6, 8, and p on a 
wave front, and there are two such functions which we 
define in this way, one corresponding to each side of the 
wave front. The derivatives of either of these functions 
are given by 


prj=prjtpWz;, (43) 


§2j=82j5+8Wo;. 


Combining (39)—(42) with (43) yields 
. pL 1—uWe]+ui[—pWz; | 
= —p(a,)zi— pri;— p(ui)zi—prit,, (44) 
6 — Po Wei] +t 1—uWej]+8, — po Wei] 
= fi— ppp pei— pap Bai + pp; 
— (u;)2ij—Uu;(G;)2;—p (pp) zip—p'(pp)2i8, (45) 
§[1—uj;Wej ]= —djs2j;—Ujdz;. (46) 
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One such set of equations is obtained on each side of 
the wave front. We now subtract each equation on side 
two from the corresponding equation on side one. Then, 
remembering that the basic flow was assumed to be 
continuous, and that As= As=0, we obtain 


[1—u We; JA(6)+L—pW2 JA (air) 
= — pA(iti,)r;— pr, Ati;— (u;)r,Ap—ujApr;, (47) 
C— pop Ws; JA(6) 
$+[1—uW 25] Wiis) + pp Was] 6) 
=[p*pri—p (pp) 21 JAP— ppp Apri 
— (u;)2jAtj;—uj;A(U,)z;, (48) 
paella Nitin inci (49) 


Equations (47)—(49) are a set of five linear inhomo- 
geneous equations for the five quantities Ap,, Avi, and 
As,. Let us compare these equations with the shock 
conditions which are a set of five homogeneous equa- 
tions for Ag, Av;, and As. These latter equations can be 
rewritten in the form 


UApt+ pv;Au;=0, (50) 
vip, Ap-+pUAt;+ vip, As=0, (51) 
pUe,As=0. (52) 


If (47) is multiplied by 1/(W,2)!, (48) by p/(W.2)!, 
and (49) by pe,/(W,2)! then the coefficient matrix on 
the left side becomes identical with that of (50)—(52). 
Therefore, (47)—(49) can have a solution if and only 
if the right sides of these equations satisfy a certain 
orthogonality condition, since by assumption the 
homogeneous equations have a nonzero solution. To 
obtain this condition we eliminate As, from (48) by 
means of (49), and similarly eliminate As from (51) 
by means of (52). The resulting sets of four equations 
in four unknowns will both have the same coefficient 
matrix, which becomes symmetric if we multiply (47) 
and (50) by p, and (48), (51) by p. Now the solvability 
condition for (47), (50) is that the right side be orthog- 
onal to the solution of the transposed homogeneous 
equations, which are just (50), (51) because of the sym- 
metry. Thus we obtain the condition 


— Pp, {p(Ad,)2;+ pxri:Ati;+ (u;)2,Apt+ujApz; } Ap 
+p? {Lp pzi— ppp) xi JAP— ppp Ape: 
= (u;)2j;Au;— u;(Au;)z; } Au; 


+ Upp, szj;Atijv,Au;=0. (53) 


Equation (53) can be considerably simplified and 
solved explicitly. First, we note that Ad;=Awu; and 
Ap= Ap. Then we eliminate Az; in terms of Ap by (24), 
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and we eliminate Ap by the relation Ag=cAg, which 
follows from (5) and (28). In this way we obtain from 
(53) an equation in which the only acoustic quantity 
appearing is Ap. This equation is, after dividing by Ap, 


Ap Ap Ap u;(Ap)zi 2u;Apez; 
(=) + (%)z:—— (u;)2;—— + 
U Ey U Ce Cc A 





prividp Vi (Ap) ry (u;) zj vivj;Ap 
pU* U - 











Ap pu; / Ap Ap 
—vani(ve——"—( —) + psSzjv =0. (54) 
: ‘Uz U pU/z; ; *pU? 


Although (54) appears to be a partial differential 
equation for Ap, it can be rewritten as an ordinary 
differential equation along a ray. To accomplish this we 
choose 

A\=-— 1/2c?(p,7)! 


in (38) so that the equation of a ray becomes 
dx; Uj; 


= yo. (55) 
da c 





Then for any function g we have gz,[ v;— (u,/c) |=dg/do, 
where dg/do denotes differentiation with respect to ¢ 
along a ray. Making use of this fact, and observing that 
the ninth term in (54) vanishes since v2=1 and thus 
v;(v;)2;=0, (54) becomes 


d (Ap)? Uj; 
— log + (=) 
do pc cl xy 


1 ViVj 
-u(-) engi: Ol 


c c 








In obtaining (56) from (54) it was necessary to divide 
by Ap/U and to make use of (29) in the form U=c. 
Equation (56) can be solved at once, with the result 


(ap)? Op) ery ( mi 
pc -| pc eof | . “) 


1 ViV; 
+u(-) + (uf. (57) 


c c 








The first exponential factor 
exp f [ — (v,) 2; do 
o 


has a simple geometrical significance when u,;=0. 
In this case o denotes arc length along a ray, as is shown 
by (55). Then by following the method of Luneburg® 


3 See reference 2, p. 86. 








we find that 

ad 1 
exp f (-— v;)xdo= —, 
«0 K 


where K is the expansion ratio along the ray. This 
expansion ratio is the limit of the ratio of the normal 
cross-sectional area of a tube of rays at o divided by the 
corresponding area at oo, as the area tends to zero. 
Of course, the tube of rays is supposed to contain the 
ray in question, and to converge to it. Thus, when 
u;=0, (57) becomes 


(ap)? 1 (Ap)? 
ie os) 
pc K 70 


pc 


Since Ap/pc is the normal jump in velocity across an 
acoustic shock and ApAw represents the power per unit 
area in the shock, (58) asserts the conservation of shock 
front energy in a tube of rays. 

Equation (58) yields the shock strength Ap at any 
point on a ray in terms of its value at one point on the 
ray. Once Ap is known, Ap and Ai; can be immediately 
computed. Since (58) just inyolves K, Ap can be found 
once the rays are determined. It is useful to note that 
the normal cross section of a tube of rays is a portion 
of a wave front, and that the area of such a portion is pro- 
portional to the Gaussian curvature of the surface. 
Thus K is also the ratio of the Gaussian curvature at a 
point on a wave front, to the corresponding curvature 
of another wave front of the same family at a point on 
the same ray. 


E. Reflection and Transmission of an Acoustic 
Shock at a Contact Discontinuity 


We will now consider the reflection and transmission 
of an acoustic shock at a contact discontinuity in the 
basic flow. An incident wave front will, in general, give 
rise to one reflected and one transmitted wave front, and 
to a modified motion of the contact discontinuity. If 
these moving surfaces are represented in four dimen- 
sional x;, / space, they become stationary, three-dimen- 


Fic. 1. A shock impinging 
upon the interface between two 
media. Reflected and _ trans- 
fe) mitted shocks are shown. The 
numbers label the various 


'r*4 regions between pairs of sur- 

2 faces and the arrows indicate 

3 the directions of motion of the 
shocks. 





944 JOSEPH B. 





KELLER 


sional hypersurfaces. They are joined together on the 
contact discontinuity surface ', which we will represent 
parametrically, assuming I’ to be stationary, by the 
equations 


«i= gi(é,n). (59) 
The three wave fronts are given by 
W'—t=0, W®-t=0, W7-i=0. (60) 


Here, and below, J, R, T will denote incident, reflected, 
and transmitted, respectively. In order that these 
surfaces be joined together on I it is necessary that 
Wl=W*=WT7 onT, that is, 


W?(g)=W*(g)=W7(g,). (61) 


This is an identity in &, n, and may, therefore, be differ- 
entiated with respect to &, n, yielding 


(We;!—W2z*)gic=0, (W2;'—Waz;*)g;,=0, 


. . (62) 
(i xj! —Wz,7T)gi:=0, (Wz7—V zi?) gin=O. 


These equations show that the difference between the 
normals to W/ and W®, and the difference between the 
normals to W/ and W7 are parallel to the normal to I’. 

We will rewrite these conditions after introducing the 
wave normals p/, p*, and 7, and the unit normal .V 
to ’. Then the preceding equations imply 


p®=p'+aN, 
pt= pita. 


Here 4a, a2 are scalars. From (63) we see that p’, p*®, 
p7, and N all lie in one plane, called the plane of inci- 
dence. Forming the cross product of (63) with V, we 
obtain 


(63) 


p®XN=p!XN=)p'™XN. (64) 


If we now consider magnitudes in the above equations, 
and recall that the length of a wave normal is the re- 
ciprocal of the wave-front velocity along the normal, 
we obtain , 
— sino? =— sin@! =— siné’. (65) 
yk a! 9? 


The first equality is the law of reflection, and yields 
62®=a7—6! if v®=v!, which is the case when u;=0. 
The second equality is Snell’s law of refraction. The 
angles of incidence @/, reflection 0", and transmission 
6” are between p/, p*®, p’, and NV. Since 


67=sin—[ (v7/v?) sind? |, 


there is no value of 67, and, thus, no transmitted wave 
front, if (v7/v7) siné’>1, which is called the case of 
total reflection. 

The wave fronts and I divide space into five regions 
in the neighborhood of the point set J on which these 
surfaces are joined together. We assume that as J is 
approached in each of these regions, all of the acoustic 
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quantities have finite limits independent of the direc- 
tion of approach. If we designate these regions as 0, 1, 2, 
3, 4 (see Fig. 1), then the limit of each acoustic quantity 
will be indicated by a superscript corresponding to the 
region in which the limit is approached. We next intro- 
duce the jumps 


Auf=us—u’, Auj=uf—u}, Au?Z=u)—xu?, 
Aus=u?—u3, Aus=us—u;,, etc. (66) 


It is assumed that Aw,°, Ap®, Ap®, As® are known, since 
these are the jumps across the incident wave front. By 
applying the jump conditions, we wish to determine 
the other jumps, especially the reflected and trans- 
mitted ones Au,', Au,’ in terms of these. 

From the shock condition (24) at an acoustic shock, 
we have 


AuS=—v;"(pc)Ap®, Aut =—v,"(pc)Ap', 
67) 
Au3=—v;7(p'c’)“Ap'. 


Here p, c designate quantities on the “‘incident”’ side, 
and p’, c’ on the “transmitted” side of the contact dis- 
continuity. Also, from the definitions of the jumps, 
we have 


Ap’+Ap'+ Ap?+ Ap ?+ Ap'=0, (68) 
Au®+ Au;'+ Au?+ Aus+ Aus=0. (69) 


Since pressure is continuous across a contact discon- 
linuity, Ap?= Ap*=0, and (68) becomes 
Ap’+Ap'+ Ap*=0. (70) 


Now, we take the component of (69) normal to I and 
recall that the normal velocity is continuous across a 
contact discontinuity. Then we have V;,Au?= V,Auf=0 
and (69) becomes, after (67) is used, 


(pc)! cosb! A p+ (pc)! cosb® Ap! 
+ (p’c’)—' cos#TAP=0. (71) 


Equations (70) and (71) are two equations for the 
reflected pressure jump Ap! and the transmitted pres- 
sure jump —A?’*. The solution of these equations is 



































Pcos6! cos") 
pc pc’ 
Ap'= Ap mega ’ (72) 
cos@? cos6® 
pc pe - 
rcos6! cosd) 
pc pc 
—Ap=Ap’| ————__ I. (73) 
cos@? cos6® 
L pc’ ope 


The coefficients of Ap® in (72) and (73) are the reflec- 
tion and transmission coefficients, respectively. 

If «;=0 on the incident side of the contact discon- 
tinuity then v?=2! and (65) shows that 0?=x—8@/, 
provided N is oriented so that 0<6’<2/2. Then 
cos@” = —cos@/, and (72), (73) become 


; 1-Z : 
Ap “ia2"" (74) 
ition dad Pm cnoanstl (75) 
1+Z p’c’ cosé! 


Equations (74) and (75) are Fresnel’s formulas, usually 
deduced for reflection of periodic plane waves from a 
plane interface. 


F. Expansion Ratio for Straight Rays 


The expansion ratio K was introduced in (58) and 
determines the variation of the shock strength along a 
ray. It may be defined as the ratio of the Gaussian 
curvature of a wave front at one point on a ray to that 
at another point on the same ray. In a uniform basic 
flow with constant sound speed, the rays are straight 
lines orthogonal to the wave fronts. In this case the 
various wave fronts of a family of wave frontsare parallel 
surfaces, according to the definition of parallel surfaces 
employed in differential geometry. If Ri, Re are the 
principal radii of curvature of a surface at a point, then 
R,+Z, R2+Z are the corresponding radii of a parallel 
surface at a point at distance Z away on the same 
orthogonal (i.e., ray). Thus, 


RiR2 
K= , (76) 
(Rit+Z)(R2+Z) 





From (76) we see that K—' becomes infinite at two points 
Z=—R,, —R2, called conjugate points, on each ray. 
The locus of these points for a given family of wave 
fronts is called a caustic surface, which usually has two 
branches. The branches touch if on some ray Ri=R2. 
A point at which the two branches of the caustic surface 
touch is usually called a focus. Since K~' becomes 
infinite on a caustic, the acoustic shock strength also 
becomes infinite there. 

The expansion ratio has also been computed for a 
uniform basic flow with a curved contact discontinuity 
separating two regions of different constant sound 
speed. In this case the expansion ratio for reflected or 
refracted (i.e., transmitted) wave fronts becomes im- 
portant. For an incident spherical wave front these 
expansion ratios were computed by J. B. and H. B. 
Keller.‘ For an arbitrary incident wave front the result 
is much more complicated, and is given by Stanley 


‘J. B. Keller and H. B. Keller, J. Opt. Soc. Am. 40, 48 (1950). 








Preiser and Joseph B. Keller. Some parts of these 
results have also been given by others. 


G. Example: The Shock Tube 


As an example of the utility of the preceding theory, 
we will apply it to the shock-tube or one-dimensional 
explosion. The results are also useful in understanding 
some of the phenomena in spherical explosions. 

We suppose that all quantities depend upon x and / 
only and that only the x component of velocity, which 
we call u, differs from zero. The region —a<x<0 is 
supposed to be filled with a perfect, nonconducting gas 
at high pressure, and bounded at «=—a by a rigid 
wall. The region x>0 is supposed to be filled by a 
perfect, nonconducting gas at lower pressure. As the 
basic flow we take the state of rest: u=0, p=const, 
p=pi OF pox, S=S; OFr So, according as x>0 or x<0 
(see Fig. 2). 

We now assume that at /=0 the acoustic quantities 
vanish for x>0 and that for —a<x<0 they have the 
constant values p, 6, 8, a=0. Thus, at ‘/=0 the surface 
x=0 is an acoustic discontinuity. By Huygen’s con- 
struction we see that, if this gives rise to acoustic 
shocks at later times they will be located at x=cil 
or —C2t where ¢;, C2 are the sound speeds in the basic 














ISS 


x 


Fic. 2. An x—/ diagram of the shock tube. The x axis is hori- 
zontal, and at ‘=0 the high-pressure gas lies between a rigid wall 
at x=—a and a diaphragm at x=0, while the low-pressure gas 
lies to the right of x=0. At ‘=0 the diaphragm is broken and a 
shock travels to the right while a rarefaction travels to the left 
(solid lines). The rarefaction is reflected from the wall at x= —a, 
travels to the right, and is reflected and transmitted at x=0, 
which is the undisturbed boundary between the two gases. This 
process repeats periodically. The dashed line on the right is the 
location of the moving interface between the two gases. This 
interface moves with constant velocity between successive shock 
reflections. The vertical dashed line is the undisturbed location 
of the interface. 


5S. Preiser and J. B. Keller, Research Report No. EM-20, 
Washington Square College Math. Research Group, New York 
University, New York (1950). 
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flow for x>0O and x<0, respectively. Furthermore, 
the strengths of these shocks will be constant since the 
expansion ratio for a plane wave front is unity. 

To solve the problem, we assume that the solution is 
piecewise constant, the discontinuities occurring across 
the shocks and across the contact discontinuity at 
x=0. The acoustic quantities to the right of the shock 
x=c,l and to the left of the shock x= —cot have their 
initial values, while between the shocks the acoustic 
pressure and velocity will be denoted by j, a. These 
quantities are continuous across the contact discon- 
tinuity x=0, and we assume that they are constant. 

From the shock conditions applied at each shock we 
find 





A le (77) 
P22 Pic 
Thus, 

——— (pc): (78) 

ae a a é 
(pc) 1+ (pc)» 

1 

i= 9——_—. (79) 

(pc)1+ (pc)2 


The foregoing results (78), (79) will apply until the 
left shock reaches the rigid wall at x= —a, when a re- 
flected shock is introduced. Since @=0 at a rigid wall, 
the velocity jump across the reflected shock is the nega- 
tive of that behind the incident one. Therefore, the 
reflected pressure jump is equal to the incident pressure 
jump. This result also follows from the reflection formula 
(74) when p’c’ is infinite. The solution with one reflected 
shock applies until the reflected shock reaches the con- 
tact discontinuity, when it is partially reflected and 
partially transmitted according to (74), (75). The 
reflected shock will again be reflected at x= —a, and 
this process will continue indefinitely. 

The strength of the first shock traveling to the right 
of x=0 is given by (78), since the acoustic pressure in 
front of it is zero. We will denote the strength of this 
shock by ap, where a= (pc);[(pc)i+(pc)2}". Then the 
strength of the first shock traveling to the left is 
(a—1)p, which is also the strength of this shock after 
reflection from «= —a. The shock transmitted through 
the contact discontinuity at x=0 will have strength 
T(a—1)p, and the reflected shock will have strength 
R(a—1)p. Here R and T are the reflection and trans- 
mission coefficients given by (74), (75) with @/=0, 
6®=7. In this way we see that the strength of the mth 
shock traveling to the right of x=0 is R"*T(a—1)p for 
n> 1. 

The pressure , behind the mth shock traveling to the 
right of x=0 can easily be computed by summing all 
the pressure jumps across the preceding shocks. In 
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this way we find ~;=ap and 


1i— n—1 
Pr= [a+ T(a— »)-—- p= 2R"- n>2. (80) 


The velocity «, behind the mth shock traveling to the 


right of x=0 is then given by 
Un= Pn/ prc. (81) 


The contact discontinuity, at x=0 in the basic flow, 
moves a distance d,=(2a/c2)u, between the nth and 
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(n+1)st shocks. Thus, the total distance D, which it 
moves until the arrival of the (7+1)st shock is 





2a p T(a—1) n—1 
D.=—-* | nat ———(n—1- > R»| 
C2 pic 1—R k=l 
ap 
=—(1—R"). (82) 
p2C2" 


From (82) we see that D, oscillates if R<0, and D, 
varies monotonically with n if R>0. 
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We consider a body which under stress exhibits an elastic, a viscous, and a retarded elastic response. For 
the particular case of a constant velocity applied at the end of a semi-infinite bar, the transient stresses, 
strains and velocities are found by the method of characteristics. From this solution can also be obtained 
the stresses for the case of a constant stress applied at the end of a semi-infinite bar. By superposition we 
can also solve problems involving finite bars. For a constant velocity applied at the end of the bar an inte- 
gral expression is derived for the stress at that end as a function of time. 


1. FUNDAMENTAL CONSIDERATIONS 


EE and Kanter' found transient stresses and strains 
for a body whose stress-strain law could be repre- 
sented by a two parameter Maxwell model. We con- 
sider here a stress-strain law represented by a four 
parameter model as in Fig. 1. The stress-strain law is 
given by 
e=ertetes 


Eya=o 


1 
Ere2+ “roe (1) 
M2 
1 
—€3,1=0, 
M3 


where the subscript / denotes differentiation partially 
with respect to the time ¢. The correspondence to the 
model arises by considering o as the force applied at the 
point A causing an extension of the model which will 
be the strain «. €; will correspond to the extension of 


* Sponsored under a Department of Defense Contract, NOrd- 
11496 between the Bureau of Ordnance, Navy Department, and 
Brown University. 

+ Staff member at University of California, Los Alamos Scien- 
tific Laboratory. Formerly research associate in the Graduate 


Division of Applied Mathematics, Brown University. 

t Professor of Applied Mathematics, Brown University, Provi- 
dence, Rhode Island. 

1 FE. H. Lee and I. Kanter, J. Appl. Phys. 24, 1115-1122 (1953). 


the spring, €2 to the extension of the combination spring 
and dashpot in parallel, and ¢; to the extension of the 
dashpot.? , 

With the four parameter model we have added a 
retarded elastic element to the two parameter Maxwell 
model. With this added response it will be, in general, 
possible to approximate more closely the actual be- 
havior of a polymer. 

When a stress is applied to an amorphous high poly- 
mer there are microscopic and macroscopic motions. 
There is an instantaneous elastic response wherein the 
atoms move further apart, valence angles are changed, 
or valence bonds are stretched or compressed. There is 
flow as the molecules shift their centers of gravity and 
reach new equilibrium positions. There is also a re- 
tarded elastic phenomena. This is the unfolding of the 
long molecules so that they are oriented in the direction 
of the applied stress. 


%* 1 
Pe 
T1 


i 
E, ae 45° Bs A 
E> i ai 
ELASTIC LESS viscous 


RETARDED 
ELASTIC 














Fic. 1. Four parameter model. 


2 T. Alfred, Jr., Mechanical Behavior of High Polymers (Inter- 
science Publishers, Inc., New York, 1948). 
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When the stress is removed there will be two types of 
recovery: the instantaneous recovery as the valence 
angles and valence bonds return to their unstressed 
equilibrium positions, and the retarded elastic recovery 
as the molecules which were unfolded and uncurled 
return to their equilibrium state. The viscous flow is 
not recoverable as it is a result of the molecules moving 
to new equilibrium positions. 

In Fig. 2 we have compared the four parameter model 
with the Maxwell model and the elastic element. 

Let us first consider a creep test. A load is applied 
at time ‘=0 and this load maintained until time /= ¢». 
The elastic element is strained instantaneously and 
maintains the strain as long as the load remains con- 
stant. The Maxwell model has the same instantaneous 
elastic strain but also exhibits a constant flow rate. 
The four parameter model has the same instantaneous 
elastic strain and viscous flow but also has a delayed 
elastic response. The rounding off of the initial portion 
of the flow is due to this delayed elastic portion of the 
model which approaches the strain ¢/ E, asymptotically. 

Let us now consider unloading at time /=/o. The 
elastic element immediately returns to zero strain. The 
Maxwell model recovers the elastic portion of the strain 
but still has a permanent strain because of the viscous 
flow. The four parameter model also immediately re- 
covers the elastic portion of the strain and asymptoti- 
cally recovers the strain of the delayed elastic portion 
of the model. A permanent deformation is left because 
of the viscous flow.. 

Thus we can approximate more closely the actual 
behavior of an amorphous high polymer with a four 
parameter model since we can include the motion due 
to the unfolding and preferential orientation of the 
molecules out of their equilibrium state. 

With a random distribution of molecules there would 
be an entire spectrum of relaxation times for the re- 
versible elastic portion of the model. If so, we must 
consider the four parameter model as only an approxi- 
mation wherein we have picked some average value for 
a relaxation time which will give satisfactory results for 
a considered time interval.” 

In addition to the stress-strain law (1) we have the 
dynamic equilibrium condition 


Tr = Puy. (2) 
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We also use the definitions of «, 7 


Ou Ou 
e=—, r=—- (3) 
Ox al 


Equations (1), (2), and (3) comprise the system of 
equations to be solved in conjunction with appropriate 
boundary conditions. If we put these equations in 
dimensionless form we obtain an equivalent problem to 
solve in which the five physical constants have been 
combined to yield two parameters. Eliminatng ie, €; 
from (1), (2), and (3) we put the resulting equations 
in the dimensionless form 


€2*+ €2, : = o*, 











€,*=0;*, 
{ (4) 
o:*=1,", | 
€,*— €, ,*—ao*=Be,*, ) 
where 
1 1 
.,= ~§, t= wane, 
b2(pE2)* bok» 
Ey E, : Ey { = 
e= —¢, = €2*, a= E\o*, vt= me (5) 
Ee» E» (pE2)} 
Ms 42 
ers p=—- 
Me Ey 


By simple elimination of variables we can find the 
partial differential equations satisfied by the dimension- 
less stress, strain, and velocity.’ It is found that these 
three satisfy the same partial differential equation so 
that if we find the transient stresses for a constant 
stress applied at the end of a bar we will also have the 
solution of the problem of finding the transient veloci- 
ties for a constant velocity applied at the end of the bar. 

We are primarily interested in the transient stresses 
which occur in the body and we can find the most 
information about the stresses if we find the stress and 
velocity distribution for a constant velocity applied at 
the end of the bar. As mentioned above, we have thus 
automatically solved the problem of finding the stresses 
for a constant stress applied. We note that in the char- 
acteristic solution the values of stress and velocity at 
a point are found simultaneously in constructing the 
solution. 

The numerical characteristic solution is found for 
values of a=1, 8=1 in which case all elements of the 
four parameter model have a significant effect. A check 
of the characteristic solution has been found in the form 
of an integral which represents the stress at the end of 
the bar where the constant velocity is applied. 


3 See Appendix I for the partial differential equations. 
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2. CHARACTERISTIC EQUATIONS 


The system of Eqs. (1) is a hyperbolic system of 
linear partial differential equations and can be numeri- 
cally integrated by the method of characteristics under 
appropriate boundary conditions. We can immediately 
eliminate e,* from (4) to give 


€2 7*=a*— €* 
o;*—v,*=0 (6) 
€2, *+Be,*— v:* = —ao*. 


The dependent variables o*, «.*, and v* are functions 
of both € and 7 and can be considered graphically as 
surfaces in space, e.g., o*, ~, 7 space. The functions 
o*, €:*, v* are well behaved in the interior of the region 
bounded by the é axis and the line (6)!}—7=0, where 
8 has been defined in (5). Consequently their deriva- 
tives will take on particular values as we move along 
some curve in this region. 








We have 
do* do* 
da* = —d§+—dr 
0& Or 
Oes* Oe* 
de.*= di+—dr (7) 
0g Or 
ov* ov* 
dv* = —di+—dr. 
dé Or 


The systems of Eqs. (6) and (7) together form a set 
of six equations in the six unknowns 00*/d£, do*/dr, 
deo*/OE, Deo*/Ar, Av*/dE, Av*/I7, and we can solve for 
each of them. For example, using determinants to solve 
for do0*/d& we have 


o-a* 0 0 1 0 O 
0 0 0 0 O -1 
-—o* pg 01-1 O 
do* dr 0 
deo* 0 dt dr 0 0 


—) 
—_ 
al 
—) 





do* dv* 0 0 O dé dr 
a “| @) 

0& a. & 2 F 0 0 

10 0 0 0 —-1 

0 pg 0 1 —-!1 C 

dé dr 0 O 0 0 


0 0 dt dr 0 0 
0 0 0 0 dt dr. 








The characteristics are defined as those curves in the 
—r plane along which the derivatives d0*/d£, do*/dr, 
Jeo*/OE, Jes*/Ir, Ov*/dE, Av*/Ar are not uniquely deter- 
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mined.‘ For do*/dé to be finite and indeterminate we 
must have both determinants of (8) equal to zero. 
Setting the denominator of (8) equal to zero yields the 
equation determining the characteristics: 


dé[ (8)'dé-+dr ][ (8)'dt—dr ]=0, (9) 


from which we obtain the equations of the three sets of 
characteristics : 





€=constant 
1 
&+ 7=constant 
(8)! (10) 
1 
&———r=constant. 
(8)! 


Setting the numerator of (8) equal to zero yields the 
condition 


dt drdv* —Bdo*dé+ e.*dtdr— (1+a)o*dtdr ]=0. (11) 


We desire to have this expression identically satisfied 
on all the characteristics. (11) is obviously satisfied 
along the set of characteristics §=constant. Along the 
characteristic {—1/(8)!r=constant, we have the neces- 
sary condition 


(8)4dv* — Bdo* =[ (1+a)o*— €: |dr. (12) 
Similarly, on §+1/(8)!r=constant we have 
(8)3dv*+Bdo* = —[(1+-a)o*— es |dr. (13) 


In a similar manner we solve for the other partial de- 
rivatives of o*, ¢.*, and v* and obtain the additional 
information that along the family of characteristics 
£=constant we must have 


de.* = (o* — €9*)dr. (14) 


Equation (14) could also have been obtained by inte- 
grating the first equation of (4) in the 7 direction. In 
(10) we have the equations of the families of charac- 
teristics and using (12), (13), and (14) we can numeri- 
cally find the values of o*, ¢:*, and v* by integrating 
along the characteristics from the known data on the 
boundary. We shall find the boundary conditions which 
hold along the wave front and also complete the setting 
up of the problem in section 3. 


3. WAVE-FRONT BOUNDARY CONDITIONS 


According to the theory of characteristics and the 
zero initial conditions, a point & of the bar will be 
unstressed until the stress wave reaches that point at 
time 7o= (8)*&. At time 7» there will be a jump in the 
stress and velocity at the point &. White and Griffis® 


*R. Courant and D. Hilbert, Methoden der Mathematischen 
Physik (Verlag von Julius Springer, Berlin, 1937), Vol. 2, pp. 
110-122. 

5M. P. White and L. Griffis, J. Appl. Mech. 69, A-337-A-343 
(1947). 
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have evaluated this jump at the wave front by equating 
the impulse and change of momentum. Putting their 
result in dimensionless form we have 


Av* = — (8) !Ao*. (15) 


Also, from Malvern® we have 


1 
Av* = ——Aé*. (16) 
(8)! 


We consider the body to be initially unstressed and 
unstrained, i.e., initially o* = e*=v*=0. We can inte- 
grate (15) and (16) to obtain the relation holding just 
after the wave passes 


=-—. (17) 


From physical considerations, or as implied by the 
first of (4) for finite o*, we see that 


«.*=0, (18) 


just after the wave passes since this is the strain corre- 
sponding to the retarded elastic element of the four 
parameter model. The wave front corresponds to the 
characteristic §= 1/(8)!r in the £—7 plane, and putting 
(17) and (18) in (12) yields after integration 


saen|-(F>)*] 
ao" =C, ex —_ 
p 28 T 
1+a 
e*=c8 exp| - ( )-| (19) 
28 
1 
v*=—c¢y/B exp| — ( >| 
28 


which are the boundary conditions along the wave front. 
To construct a characteristic solution we must addi- 
tionally prescribe either the stress, velocity, or displace- 
ment at the end of the bar. 
If the stress is prescribed at =O it is easily found 
that along §=0 











o* = fo 
e*= fo(ar—e-* +148) 
€o* = fo(1—e~”) (20) 


ve" = folate’). 


Similarly, if the velocity is prescribed at —=0 we have 


that on £=0 


v* = — go 
o:*=0. 


(21) 


*L. E. Malvern, The Propagation of Longitudinal Waves of 


Plastic Deformation in a Bar of Material Exhibiting a Strain- 
rate Effect, Technical Report, A11-39, Graduate Division of 
Applied Mathematics, Brown University, Providence, Rhode 
Island (1949), 


GLAUZ AND E. H. 
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In the numerical work it is only necessary to use the 
first equations of (20) or (21) since we have two char- 
acteristics with characteristic relations on them along 
which to evaluate the other two unknown functions. 
We have computed the characteristic solution for 
a= 8=go=1. For this choice of the parameters all four 
elements of the model give significant strains. This is 
in contrast, for example, to a choice of very small a in 
which the viscous element would be negligible for short 
or medium time intervals. The choice of unity for go is 
purely to obtain convenient magnitudes for the dimen- 
sionless variables. Since the problem is linear, another 
choice merely changes the values of all dependent vari- 
ables proportionately. , 

From the nature of finding the characteristic solution 
by working from the origin outward it is desirable to 
have an accurate analytic representation for o*, €2*, v* 
in the neighborhood of &=7=0. For small &, rt a power 
series in these two variables converges rapidly and gives 
satisfactory values of the functions. Let us assume that 
o*, e.*, «* have approximate polynomial representations 
of the form: 


o* = a,+det+asrt+ ast asir 

+ agr?+ a78+ ast?r+ agét?+ 107° 
€2* =), +--+ ++d07° (22) 
F=ey+--++c,07°. 


For the case of a constant velocity applied at the end 
of the bar £=0 we find by substitution in ‘the partial 
differential equations, and using the wave-front bound- 
ary conditions that 


vi 


























a im _ (+1), G7B+) 
*L@ @! 4@) 48@) 
(B+y+1) Gv'8+3y+1) ) 
4(@)! 128(8)! 
1 (8)*(y—1) (y+1) . 
teal t+ 24 Er — r 
(6)! 2 2(8)! 
(2+a—8) (y8+2+a—28) ; 
aa oo &r—ttr 
6 4(8)! 
(23) 
ee} 
126(8)} 
v*= |-1+@) este e 
80 i 2 28)! 
(3—3y—By’) (y*8+7+1) 
} a | nan 
+(8) } 16) ee| 


1+a 


26 
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1.4 
POLYNOMIAL SOLUTION FOR  =8=Qo=! 

12 

Lo LEz0-0! 

T=0.05 
TO.) 
T02 
0.8 - 
o* 
Fic. 3. 

0.6 
0.4 
0.2 

° 

° ao2 0.04 0.06 0.08 ao & 0.12 014 0.16 0.18 0.20 


‘In Fig. 3 we have plotted curves found by evaluating 


o* in (23) for the special case of a=8=go=1. 

The characteristic solution was built outward using 
a linear interpolation between points and the following 
lengths of intervals along (@)!4§—7r=0 


0<7r<3 dr=0.1 
3<7r<6 dr=0.2 
6<7r<10 


dr=0.4. 


The size of the intervals can be increased as the solu- 
tion proceeds, because the surfaces become smoother 
and the error of using linear interpolation decreases. 
Malvern® has given a description of the complete nu- 
merical method. 

The solution is given in the sets of curves in Figs. 
4-9, Comparison of these curves with those in reference 
1 shows the effect on the stresses and velocities of the 
addition of the retarded elastic element. For short time 


















































Fic. 4. Characteristic solution for a=8=go=1. 


the curves are almost identical as the viscous portion 
of the retarded elastic element has not allowed appre- 
ciable motion. For a longer time however the retarded 
elastic element has flowed relieving some of the stress 
of the purely elastic element and giving a reversal of 
curvature to the stress curves. 

After a longer time the effect of the retarded elastic 
element is reduced. This is easily seen from inspection 
of the graphs of ¢:*. The retarded elastic strain ¢€* at 
the end of the bar starts at zero and rapidly increases 
to a maximum. This is the point where the retarded 
elastic element has its largest smoothing effect on the 
stress. €:* then decreases. 

In order to obtain a more complete picture for the 
four parameter model, wave propagation will have to 
be evaluated for other values of a and 8. However, 
until measurements of these values for specific materials 
have been made this seems hardly worthwhile, the 
present example being presented as a typical case. 





Fic. 5. Characteristic solution for a=8=go=1. 



































Fic. 7. Characteristic solution for a=8= go= 1. 


This solution can be utilized to determine the stress 
distributions in finite rods by means of the image 
method discussed in reference 1. This development can 
be handled graphically, and no further computations 
are necessary. 





* 


Fic. 8. Characteristic solution for a=8=go=1. 
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Fic. 9. Characteristic solution for a=B=gy=1. 
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4. ANALYTIC SOLUTION 


Several methods of effecting an analytic solution 
have been tried but the solutions are much too compli- 
cated to yield results which can be reasonably evaluated. 
The characteristic solution has been found to be the 
quickest method to obtain results. However an analytic 
expression for the stress at the end of the bar as a func- 
tion of time has been found to check the characteristic 
solution. 

For the case a=8=1 we have obtained the following 
result using the Laplace transform method: 


=\1 
) 


(5 
o* (0,7) =exp(—3r 210 —-r) 
? 


ta (5)! 
+f exp(—A— T, 2»10(—2) 


XK {Jol (7 -\) J4+- 04 (7A) Jar. (24) 
The derivation is given in Appendix II. To check our 
characteristic solution, (24) has been evaluated for 
7=10 and the value 0.194 was obtained as compared 
to the value found in the characteristic solution of 0.193. 
This indicates that the characteristic method of solu- 
tion for this particular type of problem gives very 
accurate results. 

Equation (24) can be generalized to arbitrary a, 8 
very easily. 

APPENDIX I 


By eliminating all unknowns but one, we solve for 
o*, e* in (4) to find that they satisfy the same partial 
differential equation 


Bo rre*—oeee* + (1+at+B)o,,*—o¢:*+a0,*=0 


(A.1) 
Beésrr*— €zee* + (140+ 8) €,,*— €¢:*+a€,*=0. 
We also have from (4) that 
o*= Y,", (A.2) 


and hence from (A.1) 
Bsere*—Vg¢er* + (1t-at+B)vers*+0e¢r*+tats:*=0. (A.3) 
Integrating with respect to 7 gives 


Bves,"— Vets + (i+a+8)0,,*— vee*+av,*=0, (A.4) 
where we have used the conditions that the body is 
initially unstressed and unstrained with zero velocity. 

The fact that all of the dependent variables satisfy 
the same differential equation gives us much additional 
information when a single characteristic solution is 
found. Because of the range of solutions to be investi- 
gated in problems with two variable parameters any 
additional information without computing another solu- 
tion is highly desirable. 








plic 


a, B 


to t 


We 


For 


The 


i* = 





or 


or 
ial 


is 
ty. 
sfy 
nal 


sti- 


ny 
lu- 





TRANSIENT WAVE ANALYSIS 953 


APPENDIX II 


We use the Laplace transform method and for sim- 
plicity will consider the case of a=6=1. The general 
a, 8 case can be considered similarly. 

We apply the Laplace transform 


p~kige f exp(—pr)e(r)dr (B.A) 


to the set of Eq. (6) and obtain after simplification 


G3" -(tee Jaro (B.2) 


o:*= po*. (B.3) 
We have used the initial conditions that 
o* (£,0) = e2* (£0) = €* (E,0) =0*(£,0)=0. (B.A) 
For boundary conditions we have that 


v* (0,7) ete 


(B.5) 
lim o*(é,7)=0. 
These boundary conditions transform to 
D*(0,p) ———s 
(B.6) 


lim *(t,p)=0. 
Solving (B.2), (B.3) using (B.6) we have 


&* = gy exp— (tor) / (torr) 
. i+p l+p 


(B.7) 
P° 
i*= — £0 €xp— ioe p+ r) é. (B.8) 


From (B.7) we find that for the end of the bar 











go(1+p)! 
6*(0,p)= — (B.9) 
(p)!(p+a)?(p+5)! 
where 
3 (5)! 3. (5)3 
a=-— , b=- : 
62 oe 


Formally, the inversion integral gives 
go putin er(A+1)3 

I=o* (0,7) =— dd, (B.10) 
2rid yi (A)}(A+a)?(A+5)! 


where we have —4<argA<r. It is easily seen that 





) 
T=e—'—(e'l’), (B.11) 
al 
with 
bo y+ ix et 
]'= dy. (B.12) 
2rid in (A)? (A+1)4(A+a) (+48)! 


From Churchill,’ 


a—b | 1 
Lever —1) - , (B.13) 
2 J) (pta)i(pt+oy! 


and consequently we have from the convolution theorem 
that 


t 5)3 
I'=g0 exp(—1/2) f on( >) 


XIof(t—d) dd. (B14) 








Hence, 


d 
o* (0,7) =exp(— 37, a+) 


1/7’ (5)! 
+-f exp(—A— 7/2) 
2/9 2 


XE (r-\VI4+HEH(r—d) Yad. (B.15) 
7R. V. Churchill, Modern Operational Mathematics in Engineer- 
ing (McGraw-Hill Book Company, Inc., New York, 1944), p. 297. 
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Thermionic Emission and Electron Diffraction from Thin Films of Barium Oxide* 


Paut N. RussEttf AND A. S. EISENSTEIN 
Department of Physics, University of Missouri, Columbia, Missouri 
(Received September 1, 1953) 


A high-vacuum, sealed-off glass, electron diffraction tube was developed for observing the physical struc- 
ture of evaporated BaO films on a nickel substrate. Simultaneously, measurements of thermionic emission 
could be made. The thermionic activity of the deposited film increased until approximately 20 monolayers 
(measured using a radioactive tracer technique) had been laid down, and no further increase was found up 
to 50 monolayers. This emission density (0.4A/cm? at 1000°K) is comparable with that obtainable from 
sprayed oxide coatings. With the receiver at less than 450°K the oxide film showed an amorphous structure. 
With the receiver at 800°K the film was crystalline and was believed to exist in clumps. This crystalline 
structure became amorphous if the films were heated to 1070°K and this form persisted as the oxide film 
was removed by evaporation. The relation of the thermionic emission from thin BaO Sims to that from 


sprayed oxide cathodes is discussed. 





INTRODUCTION 


NE salient feature of all previous studies'~ of the 

thermionic emission capabilities of monolayer 
films of evaporated BaO is the close similarity between 
both the magnitude and the temperature dependence 
of this emission and that of thick, sprayed oxide- 
coated cathodes. This similarity has given rise to 
speculation that the emission mechanism operating in 
these two physically different structures may in fact 
be similar. 

Davisson and Pidgeon,' who first studied the electron 
emission from thin BaO films found that optimum 
emission occurred for a coverage of about one mono- 
layer. However, their attempt to interpret the details 
of the emission vs thickness characteristic was based on 
a model in which the oxide was not uniformly dis- 
tributed but was laid down in clumps. Moore and 
Allison* deposited by evaporation monomolecular 
films of BaO and SrO onto tungsten and molybdenum 
surfaces and observed a monotonic increase of emission 
with thickness through twenty-five molecular layers. 
The work function of these surfaces was found to de- 
crease rapidly for. the first one-tenth monolayer and 
then only gradually, reaching a limiting value of about 
1 ev after a few monolayers were deposited. An emission 
current of nearly 1 A/cm? at 1000°K could be obtained 
from these monomolecular films. A model consisting 
of a uniform distribution of oriented molecular dipoles 
was assumed and was used to explain the observed 
emission phenomena. 

X-ray diffraction techniques have been used ex- 
tensively in studies of the physical structure of the 
oxide cathode coating and the interface region. Be- 
cause of its reduced penetration and greater scattering 
efficiency, a beam of electrons diffracted from a sample 
should provide more information about the nature of 
the surface layers than is obtainable from x-ray dif- 


* Supported in part by the U. S. Office Naval Research. 

t Now at General Electric Company, Syracuse, New York. 

1 Davisson and Pidgeon, Phys. Rev. 15, 553 (1920). 

2A. W. Hull, Phys. Rev. 56, 86 (1939). 

3G. E. Moore and H. W. Allison, Phys. Rev. 77, 246 (1950). 


fraction studies. Nevertheless, electron diffraction from 
oxide cathode surfaces has contributed little new in- 
formation beyond what has been obtained with x-ray 
scattering. 

The vacuum conditions found in most electron dif- 
fraction units are so poor that, although the oxide 
sample is usually prepared by conversion from the 
carbonate within the unit, the samples are hardly 
typical oxide coated cathodes. Only Darbyshire* re- 
ports finding BaO on the surface of cathodes converted 
from BaCO;. Other studies®:* produced patterns of 
Ba(OH), and BaS which may have resulted from 
contamination of the oxide surface by a reaction with 
residual vapors. Whereas most low-energy electron 
diffraction studies are carried out in thoroughly 
evacuated, sealed-off tubes, this technique does not 
seem to have been used extensively for high-energy 
electron diffraction work. Darbyshire’ has described a 
sealed-off, diffraction tube built for demonstration 
purposes. This resembled a cathode-ray tube, and the 
diffraction pattern was viewed on a phosphor screen. 

As a preliminary to the work reported here an in- 
vestigation® was made to compare various means for 
detecting and recording electron diffraction informa- 
tion that did not involve the use of photographic 
plates within the diffraction chamber. The technique of 
allowing the diffraction patterns to fall on a phosphor 
screen within the evacuated chamber and recording 
the pattern photographically with a camera external 
to the vacuum was found to be quite satisfactory. 
Since quantitative measurements of diffraction inten- 
sities are seldom required, a calibration of the system 
is not necessary. 

A study of the thermionic emission characteristics of 
thin BaO films was ‘undertaken in a high-vacuum, 
sealed-off, electron diffraction tube in which observa- 


( 4J. A. Darbyshire, Proc. Phys. Soc. (London) 50, 635, 964 
1938). 

‘8H. Gaertner, Phil. Mag. 19, 82 (1935). 

6 Huber and Wagener, Z. tech. Phys. 23, 1 (1942). 

7 J. A. Darbyshire, Elec. Eng. 22, 385 (1950). 

8 A. Eisenstein, paper presented at ASXRED summer meeting, 
1948. 
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THERMIONIC EMISSION AND 


tions of the physical structure of the film could be 
correlated with measurements of the electron emission. 
Changes of both structure and emission accompanying 
heat treatment of the sample could be followed as the 
BaO film was removed by evaporation. 


EXPERIMENTAL PROCEDURES 
I. Determination of Evaporation Rate 


Although the vapor pressure of BaO is well known,’ 
the rate of deposition was verified experimentally 
using a structure identical with that to be used in the 
final diffraction tube. This preliminary experiment was 
carried out to avoid possible error in the calculated 
amount of deposited BaO which might result from a 
temperature gradient in the evaporator or from an 
inaccurate calibration of the monitoring thermocouple. 
At the temperature used for evaporation, 1310°K, the 
rate of deposition doubles for a temperature change of 
about 15°. 

The deposition rate was determined by the use of 
radioactive tracer techniques in an evacuated tube. 
Ba'CO;{ was applied to a platinum§$ evaporator 
structure identical with that used in the diffraction 
tube. Above the evaporator at a distance of 7 mm was 
mounted a plane nickel shield with a 5-mm hole 
punched in its center. Above the hole was suspended 
one of several nickel sample buttons which could be 
manipulated into position for a given time of evapora- 
tion and could then be removed, making way for 
another. During a period of evaporation the tempera- 
ture of the evaporator was held constant at 1310°K 
as measured by a Pt—Pt+10-percent Rh thermo- 
couple. When each sample had been exposed to the 
molecular beam for a definite time ranging from 2 
minutes to 10 hours, the tube was cracked open and 
the radioactivity of each sample was compared with 
that of a standard aliquot of the original BaCO;. The 
plot of Geiger counts per minute vs evaporation time 
on a log-log scale gave a straight line with a slope of 
unity, as shown by the upper curve in Fig. 1. 

Closer examination of the data revealed that the 
count due to the radioactivity of the first button, which 
had received a 20-minute deposition, was about 30 
percent greater than the count of the last button which 
had also received a 20-minute evaporation, but 19 
hours later. Ba’ with a half-life of 13.4 days decays 
into La™ with a half-life of 40.2 hours. In the aliquot 
and on the evaporator the lanthanum is in transient 
equilibrium with the barium, but due to the refractory 
nature of La,O; it is reasonable to assume that no 
significant amount of this oxide is evaporated at 
1310°K, only BaO. The lower curve of Fig. 1 results 


¢ Prepared from Ultra Pure Barium Carbonate, Mallinckrodt 
Chemical Works, St. Louis, Missouri. 

§ 99.9-percent pure Pt obtained from J. T. Baker Co., Newark, 
New Jersey. Evaporator structure outgassed at 1500° K in 
vacuum before applying the BaCO3. 
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Fic. 1. Counting rate produced by Ba™0O reaching the collector 
plate as a function of the evaporation time at 1310°K. Lower 
curve corrected for activity of daughter La™. 


from correcting the observed counting rates and 
brings the two 20-minute depositions to within 7 per- 
cent. Furthermore, the corrected deposition rates are 
found to agree within a factor of two with the rates 
computed from the vapor pressure of BaO and geo- 
metrical considerations. For convenience the term 
“monolayer” will be used to represent an average 
surface density of 10'° BaO molecules/cm?, whether 
the molecules form a true monolayer or aggregate at 
isolated regions of the surface. Figure 1 shows that 
one monolayer is deposited in 27 minutes at 1310°K. 


II. Diffraction Tube Construction and Operation 


Preliminary studies of thermionic emission from 
evaporated BaO films were carried out in evaporator- 
diode tubes having the same geometry as that used in 
the electron diffraction tubes. The results presented 
here are based on the performance of five evaporator- 
diode tubes and four electron diffraction tubes. Only 
the electron diffraction tube will be described in detail. 

This tube, shown in Fig. 2, consisted of an elongated 
cathode-ray tube blank with a commercial electron 
gun of the type used in the 10F P4 tube!! at one end and 
a phosphor screen of the P4 variety in the other end. 
The evaporator-diode structure was placed in a cross 
arm midway between the gun and the screen, in such a 
position that the polished nickel sample would inter- 
cept the electron beam at near grazing incidence. 

The evaporator-diode structure consisted of a 
polished receiver of pure electrolytic nickel{ indirectly 
heated by a tungsten coil. The temperature was meas- 


|| These guns were kindly supplied by J. J. O’Callaghan, 
Rauland Corporation, Chicago, Illinois. 

{{ 1001 Nickel obtained from E. M. Wise, International Nickel 
Company, New York, New York. 
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Fic. 2. Complete electron diffraction tube showing electron gun, evaporator-diode structure, and phosphor viewing screen. 


ured by a Mo-Ni thermocouple. Immediately above at 
a distance of 17 mm was mounted the platinum faced 
evaporator. This evaporator was indirectly heated and 
its temperature was measured by a Pt— Pt+ 10-percent 
Rh thermocouple. Between the evaporator and the 
receiver was located a hinged tantalum anode which 
could be rotated out of the way during evaporation of 
the BaO and also during the observation of the electron 
diffraction pattern; but which in its closed position 
served as a shield for the clean nickel receiver surface 
during conversion of the BaCO; to BaO, and as an 
anode for the measurement of thermionic emission 
from the thin deposit of BaO. 

The metal components of this structure were in- 
dividually vacuum degassed or hydrogen fired before 
assembly. The platinum evaporator was placed be- 
hind a mask which had a 0.1-cm? aperture, and was 
sprayed with a water suspension of Ultra Pure BaCO; 
to a surface density of 15 mg/cm*. The structure was 
immediately assembled and sealed onto the electron 
diffraction tube. The tube was then evacuated, proc- 
essed, and sealed off at a pressure of about 5X 10-* mm 


of Hg, and the Batalum getter flashed. It was then 
“aged” by applying successively higher voltages to the 
accelerating electrodes until it would operate stably at 
well over 25 kv. During this process the pressure as 
measured by an attached Bayard-Alpert ionization 
gage did not rise perceptibly. The pressure after aging 
was less than 1X 10~-* mm of Hg. 

The electron beam was brought to a focus on the 
phosphor screen by means of a commercial electro- 
magnetic focusing coil. Since the sample was rigidly 
mounted in the tube, it was necessary to deflect doubly 
the electron beam to the sample at grazing incidence. 
The resulting diffraction pattern was photographed 
externally by means of a Dumont 35-mm oscilloscope 
camera, f/3.5, using Eastman Super XX film. Ex- 
posures of 2, 4, and 8 minutes were made of each 
pattern, and the film was developed before the physical 
state of the sample was changed. Halation rings, 
usually present because of the primary beam of high- 
energy electrons impinging on the phosphor and 
producing light which is partially reflected internally 
in the glass face plate, were eliminated by removing 
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the phosphor from a small area of the screen in that 
region during the construction of the tube. The film 
was then viewed, and the radii of the rings measured 
with a microprojector. Calibration of the over-all 
optical magnification was accomplished by photo- 
graphing an opaque patch of known dimensions which 
had been secured to the outer surface of the face plate. 


III. Electron Emission Measurements 


Thermionic emission currents from the cathode at a 
given temperature were measured for anode potentials 
from 20 volts positive to several volts negative and 
were used to construct retarding potential plots from 
which the zero field emission current densities J) were 
obtained. This procedure was repeated for a series of 
cathode temperatures measured in descending order. 
A Richardson plot representing the particular state of 
activation of the cathode was made and used to 
evaluate the work function and the characteristic 
emission of the cathode at 1000°K. 


EXPERIMENTAL RESULTS 


In its initial state, the thermionic emission of the 
“clean” nickel was too low to be measured with ac- 
curacy with the available equipment; it was of the 
order of 10-" amp/cm? at 1000°K. The receiver was 
held at 800°K while one monolayer of BaO was de- 
posited. The subsequent temperature dependence of 
emission is shown by curve A of Fig. 3. When the sample 
was heat treated for 10 minutes at 1070°K the emission 
characteristic fell to curve B. Further heat treatment 
of the sample for 10 minutes at 1170°K gave curve C. 
In this state the electron diffraction pattern of the sur- 
face corresponded to that of nickel rather than BaO, 
and further heating did not further reduce the emission. 
The subsequent curves in Fig. 3 represent two complete 
repetitions of this cycle. The close similarity of these 
three sets of curves indicates the reproducibility of the 
electron emission characteristics of deposited BaO 
films on a nickel substrate. When this procedure was 
repeated using 3 monolayer depositions, longer heat 
treatments of the sample were necessary to reduce the 
emission, but in no case was it reduced below that of 
curve C. 

More than 50 sets of photographs were made of the 
diffraction patterns produced in this tube representing 
different physical states of the emitting surface. For 
brevity and convenience all diffraction patterns ob- 
served in this study may be classified into 4 types and 
combinations thereof: (I) “Laue type” pattern; (II) 
broad ring pattern, some rings exhibiting preferred 
orientation; (III) sharp ring pattern; and (IV) diffuse 
pattern. These are shown in Fig. 4. 

Type I was observed only from the surface of ‘‘clean”’ 
nickel either before deposition of BaO, or after a heat 
treatment sufficiently severe to cause it to re-evaporate. 
Photo micrographs and electron micrographs, using 


the replica technique, were taken of the polished nickel 
surfaces. Crystals having an average dimension of 
0.025 mm are present in the nickel substrate; hence, 
as many as 10* crystals are bathed in a 1-mm? beam of 
electrons grazing the surface. Of this number perhaps 
20 are properly oriented to satisfy the Laue conditions 
for diffraction and produce the spots observed in this 
type of pattern. The presence of this type of pattern 
and the low emission represented by the Richardson 
plot Fig. 3, curve C, were the criteria for a ‘“‘clean” 
surface following the re-evaporation of the BaO. 

The Type II pattern was observed after the deposi- 
tion of one or more monolayers of BaO onto a receiver 
held at 800°K. Since it is generally believed that 
electron diffraction patterns are not observable from 
crystallites having less than five or ten diffracting 
planes, this deposit is probably not a true monolayer. 
Rather, it is a “patchy” deposit in which only portions 
of the surface are covered by the BaO and those 
portions are covered to a depth of several molecular 
layers. It will be remembered that one deposited 
monolayer was defined as 10'° molecules per cm? 
irrespective of its physical arrangement. Measurement 
of the radii of the diffraction rings indicated that in 
every case the deposited material had a face centered 
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Fic. 3. Typica! Richardson plots of the thermionic emission 
from BaO deposited on a pure nickel base receiver. (A) One 
monolayer deposition; (B) after heat treatment for 10 minutes 
at 1070°K; and (C) after 10 minutes at 1170°K. [Process re- 
peated showing reproducibility, in sequence (D), (E), (F), (G), 
— and (J). Type of diffraction pattern observed shown at 
top. 
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Fic. 4. Four types of electron diffraction patterns observed: 
(I) “Laue type” pattern of the nickel receiver; (II) broad ring 
pattern of BaO showing some preferred orientation; (III) sharp 
ring pattern of BaO; and (IV) diffuse pattern of BaO. 
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cubic structure with a lattice constant of 5+1A. 
Barium oxide is face centered cubic with a lattice 
constant of 5.53A. For thin deposits the resulting 
pattern showed variation of intensity along a given 
ring. This was attributed to a preferred orientation of 
the deposited crystallites with the (1,0,0) planes of the 
BaO coinciding with the nickel surface. Patterns from 
thicker deposits showed less preferred orientation, 
indicating that the orientation of the upper layers 
does not necessarily correspond to that of the first 
deposited layers and may thus represent the growth of 
new crystallites. The breadth of the diffraction rings 
indicates that the crystallites were very small, prob- 
ably less than 100A in size, or that the lattice was 
rather imperfectly developed, or both. 

The type III pattern was similar to type II, but 
differed from it in that the rings were quite sharp, in- 
dicating crystallites larger than 100A in size and having 
no preferred orientation. This pattern was not ob- 
served when BaO was deposited on the pure nickel 
substrate, even for deposits of up to 50 monolayers. 
The sharp ring pattern was found only in some sub- 
sequent depositions on nickel alloy substrates and is 
shown here as evidence that thick films of BaO having 
well-developed crystals may be obtained by evapora- 
tion and detected by electron diffraction. 

The very diffuse type IV pattern was observed under 
two conditions, (1) when BaO was deposited on a re- 
ceiver held at a temperature less than 450°K, and (2) 
as a result of very high temperature heat treatments of 
a crystalline BaO film. This diffuse scattering of the 
impinging electrons is probably produced by an 
amorphous layer of BaO on the metallic substrate. If 
the temperature is sufficiently low that surface migration 
does not occur, a regular lattice array of the deposited 
molecular dipoles is rather improbable. Subsequent layers 
on this substrate likewise may have a random arrange- 
ment and an amorphous structure would be formed. 
Electrons scattered from such a random arrangement 
of molecules would produce this diffuse pattern. 

‘Combinations of these patterns also appeared. 
Ordinarily, BaO deposited upon the base metal gave a 
ring-type pattern. With continued heat treatment of 
the sample the type II pattern seemed to fade into 
type IV, which in turn changed into type I. The 
Roman numerals bracketing the curves of Fig. 3 
indicate the type of pattern which was observed for 
each state of the emitting surface. 

The emission obtained from successive depositions 
of BaO is shown in the Richardson plots of Fig. 5. 
During this deposition the receiver was held at 800°K. 
It will be noted that the thermionic activity rises until 
about 27 monolayers are deposited, then there is no 
significant change up to the maximum aniount of 51 
monolayers. At this maximum activity the emission 
was 0.4 A/cm? at 1000°K with a work function of 
about 1.3 ev. The type II diffraction pattern was 
found in each case. 








pa 


ga 


pe 
fo 


at 
ty 





ns 


til 
no 
ion 


of 
yas 





THERMIONIC EMISSION AND ELECTRON DIFFRACTION 959 


After this thick coating had been deposited, an 
attempt was made to remove it by heat treatment and 
to study the subsequent decrease in thermionic activity. 
These results are summarized in Fig. 6. Curve \ was 
obtained after heating the sample at 1170°K for 20 
minutes. Curve » which shows a tenfold decrease in 
activity below curve \ was obtained after heating 180 
minutes at 1170°K and 80 minutes at 1270°K. From 
Richardson plots taken at intervals during this treat- 
ment it was evident that further heating at 1170°K 
had little effect. Curve ¢ represents the lowest state of 
activity obtained in this sequence, with an emission of 
5X10 amp/cm? at 1000°K and a work function of 
2.6 ev. Further reduction of activity was not attempted, 
since the type I diffraction pattern characteristic of 
the nickel substrate had replaced the type IV diffuse 
pattern. Table I summarizes the results obtained in 
these two experiments on deposition and re-evaporation. 
Listed are the amount of deposition, characteristic 
emission at 1000°K, work function, subsequent heat 
treatments of the sample, and types of diffraction 
patterns observed. 

Similar observations made in four diffraction tubes 
gave the following results: 


(1) Evaporated BaO films remained stable over 
periods of weeks with respect to their thermionic 
emission characteristics and their physical structure. 

(2) The deposited BaO assumed different physical 
forms depending upon the temperature of the receiver 
at the time of deposit. An amorphous state, giving a 
type IV pattern, was produced when the receiver was 
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Fic. 5. Richardson plots of thermionic emission from successive 
depositions of BaO, 1 to 51 monolayers. Type of diffraction 
pattern observed shown at the top. 
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Fic. 6. Richardson plots of thermionic emission from deposited 
BaO layer accompanying removal by re-evaporation. Type of 
diffraction pattern observed shown at the top. 


below 450°K. Crystalline aggregates, giving a type II 
pattern, were formed at 800°K. The amorphous state 
changed to the crystalline state when the sample was 
held at 800°K for a short time. 

(3) Preferred orientation in the diffraction patterns 
of a few monolayers deposited at 800°K gave evidence 
that the (1,0,0) planes of the crystallites coincide with 
the surface of the underlying substrate. 

(4) Maximum thermionic emission was reached at 
about 20 monolayers of BaO with little further change 
up to 50 monolayers. This emission level is comparable 
with that obtainable from sprayed oxide coatings. 

(5) Subsequent heating of the crystalline films to 
1070°K or above caused them to become amorphous. 
This amorphous form persisted as the oxide was re- 
moved by evaporation and was accompanied by a de- 
creasing thermionic emission. 

(6) Continued re-evaporation of BaO from the 
sample reduced the emission to a common minimum 
level which was six orders of magnitude above the 
emission obtainable from clean nickel at 1000°K. 

(7) It was possible to observe the same thermionic 
emission characteristics, work function, and character- 
istic emission at 1000°K, from samples in two different 
physical states as judged by the electron diffraction 
patterns. For example, a thin crystalline deposit and an 
amorphous film of unknown thickness which was 
formed by heat treatment of a thick deposit gave 
almost identical Richardson plots. Thus, a given 
physical state of the oxide is not uniquely defined by 
its thermionic emission characteristics. 
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TABLE I. 
Characteristic 
Work emission Total 
State of function (A/cm? at Monolayers Diffraction heat 
activation (ev) 1000°K) deposited pattern treatment 
Cc 2.66 7.0X 107° 0 I 
A 1.90 1.810"? 1 II 
S 1.81 6.010"? 3 II 
Z 1.66 2.0X10"! 6 II 
a 1.62 2.0X 107 9 II 
B 1.53 4.0107! 12 II 
oY 1.42 4.0X10"! 15 II 
65 1.39 40X10" 21 II 
€ 1.27 4.0X10" 27 II 
t 1.37 4.0107! 51 II 
1.53 1.610! ILIV 10 min 1170°K 
m 1.82 2.0X10-? IV ; 180 min 1170°K 
80 min 1270°K 
re) 2.58 5.0X 10 I 180 min 1170°K 


140 min 1270°K 








INTERPRETATION OF RESULTS 


Several features of the observed changes in thermionic 
emission and physical structure of BaO films accom- 
panying deposition and heat treatment were unexpected 
and require interpretation. - 

Films laid down on a receiver at or below 450°K 
showed an amorphous structure but became crystalline 
when heated to 800°K whereas films laid down on a 
receiver at 800°K were crystalline at room tempera- 
ture. These observations seem to indicate that the 
surface and bulk mobility of BaO is so low at 450°K 
that the molecules cannot orient themselves properly 
to build up the crystal lattice. Thus they have a random 
arrangement in the film. At 800°K the diffusion rate 
has increased sufficiently to permit the rearrangement 
of molecules into a crystal lattice, which form repre- 
sents the state of lowest free energy at that tempera- 
ture. This argument may also be used to explain the 
return to the amorphous state in samples heated to 
1070°K and above. It is possible that at elevated 
temperatures the free energy is minimized by increasing 
the disorder (entropy) as would happen in a change 
from the crystalline to the amorphous form, but at 
best this is only a speculation. Since all diffraction 
patterns were taken with the sample at near room 
temperature, it must be concluded that the physical 
state of the system was “frozen in” as the sample was 
cooled from the elevated temperatures. 

The observation that maximum emission is obtained 
only when some 20 monolayers of BaO have been 
deposited is unexpected although not in disagreement 
with the results of Moore and Allison. However, 
metallic monolayers yield optimum emission for a 
surface coverage of from 0.67 to 1.0 monolayers.° 
Since a type II electron diffraction pattern was ob- 
served from a deposition of one monolayer of BaO, it 
may be reasonably assumed that this material existed 
in clumps or patches. Within each clump there must 


*I. Langmuir and J. B. Taylor, Phys. Rev. 44, 423 (1933). 





be a sufficient number of molecules to comprise a 
fairly well-ordered crystal lattice to produce the ob- 
served diffraction. The thermionic emission current 
density Jo which was measured with a low applied 
field from this patchy surface should be considerably 
less than the algebraic sum of the true zero field current 
densities from each patch. This reduced emission from 
the oxide clumps is a result of the complex electric 
field established by the surrounding areas of high work 
function.” The observed current density from this 
patchy monolayer was found to be 1.8X10-* A/cm? at 


’ 1000°K, only five percent of the emission that could be 


obtained from films of greater thickness. As the amount 
of deposited material increases, the fraction of the 
surface covered by these emitting patches likewise in- 
creases until complete coverage is obtained at about 
20 deposited monolayers. Further depositions cause 
no further increase in emission above this level. 

The observation that continued re-evaporation of 
BaO from the sample reduced the emission to a com- 
mon minimum level, six orders of magnitude above the 
emission from clean nickel at 1000°K, must be inter- 
preted in terms of a residue of BaO or some BaO— Ni 
compound. Electron diffraction patterns of the surface 
preclude the existence of these materials there. It 
seems likely that some of the BaO may have migrated 
into the crevasses of the grain boundaries which were 
clearly seen in photomicrographs of the surface. 
Van der Waal’s forces" would be large in these regions 
and tend to prevent migration to the surface even at 
temperatures which would cause rapid evaporation 
from the surface. A slow migration from these inter- 
stices to the surface could produce this constant level 
of emission which is observed over a long period of 
time. 

Thin evaporated films of BaO on Ni give a character- 
istic emission at 1000°K of about 0.4A/cm? with a 

1” R. R. Brattain and J. A. Becker, Phys. Rev. 43, 428 (1933). 


J. H. de Boer, Electron Emission and Adsorption Phenomena 
(MacMillan Company, New York, 1935), Chapter 2. 
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work function of between 1.2 and 1.3 ev, both values 
comparing favorably with the reported* emission 
values for thick, sprayed oxide cathodes. It is now 
worth considering whether the emission mechanisms 
operating in these two physically different structures 
are related. We may also consider whether the emission 
from a thin BaO film on the cathode base metal plays 
any essential role in the normal operation of thick, 
sprayed oxide cathodes. 

If the conduction of electrons from the base metal to 
the external vacuum surface in sprayed cathodes takes 
place by means of crystal conduction™:" and is followed 
by thermionic emission from the vacuum surface, it is 
difficult to assign any essentiai role to a thin oxide film 
present at the base metal-coating interface. However, 
the similarity in emission from a film composed of 
crystallites 20 monolayers thick and from the micron- 
size crystals which comprise the surface of sprayed 
oxide coatings is not too surprising. These small 
crystallites have a fairly well-developed crystal struc- 
ture, as evidenced by the electron diffraction, and thus 
may be expected to exhibit electronic properties similar 
to the larger crystals. 

If, on the other hand, at normal operating tempera- 
tures an electron gas within the pores" of the sprayed 
coating conducts electrons through the coating, an 
emission mechanism at the base metal-coating inter- 
face is desired. A thin film of the oxide on the base 
metal could supply the necessary emission to provide 
an adequate electron gas density. This density would be 





~ ® Hannay, MacNair, and White, J. Appl. Phys. 20, 669 (1949). 
83D. A. Wright, Phys. Rev. 82, 574 (1951). 
‘RR. Loosjes and H. J. Vink, Philips Research Repts. 4, 449 
(1949). 


maintained in the pores by absorption and re-emission 
from the pore walls. Since the density of the electron 
gas at the vacuum surface of the cathode is primarily a 
function of the absorption and re-emission from the 


‘oxide crystals in that region, the thin oxide film at the 


base metal serves only to maintain an adequate density 
of the electron gas at the interface. It is difficult to be- 
lieve that during cathode processing at least a few 
monolayers of the oxide have not been evaporated 
onto the base metal from the adjacent oxide crystals. 
This pore conduction hypothesis has been given added 
impetus by the recent work of Hensley'® and Young.'® 
Although in this cathode model the oxide film at the 
base metal serves an important role, the emission 
characteristics of the cathode are determined by the 
oxide near the surface. Therefore, the similarity in 
cathode emission with that of a thin film must be 
interpreted as in the previous paragraph. 

This paper summarizes the results of a study of the 
thermionic emitting surface BaO on pure nickel. 
Somewhat similar results were obtained when BaO 
films were evaporated onto a 4.7-percent W—Ni alloy. 
In general, the results were less reproducible with the 
alloy base metal, but the levels of attainable emission 
were the same as on the pure nickel. No evidence of an 
interface compound was found from the diffraction 
patterns. It is anticipated that the sealed-off, glass 
electron diffraction tube developed in this study will 
find application in the examination of other surfaces 
which must be handled under very high vacuum 
conditions. 


1° E. B. Hensley, J. Appl. Phys. 23, 1122 (1952). 
16 J. R. Young, J. Appl. Phys. 23, 1129 (1952). 
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The nature of the permanent damage retained in metals from irradiation has been investigated in some- 
what greater detail than has been done in the past. The usual assumption has been that the damage in all 
metals consists chiefly of interstitial-vacancy pairs. The model presented in this paper reduces to this picture 
for the light elements but introduces a new concept in the case of damage in the heavy metals, called a 
displacement spike. Calculations are made from which one can estimate the relationship between the density 
of interstitial-vacancy pairs and the temperature of the associated thermal spike. An assumption regarding 
the extent to which interstitial-vacancy pairs persist throughout the duration of the thermal spike has been 
made, based upon these calculations. The number of interstitial-vacancy pairs predicted in the heavy ele- 
ments is considerably smaller than that predicted by the former model. A mechanism is proposed by which 
small dislocation loops can be produced in the heavier metals by irradiation. 

This article is based upon studies conducted for the U. S. Atomic Energy Commission under Contract 


AT-11-1-GEN-8. 


INTRODUCTION 


HARGED particle or neutron irradiation is known 
to produce lattice changes in metals which can be 
retained as permanent damage as long as the metals are 


held at sufficiently low temperature. The nature of 
these lattice distortions has been the subject of con- 
siderable theoretical study, in both the open and classi- 
fied literature, by such workers as F. Seitz, H. Brooks, 
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J. D. Ozeroff, H. M. James, F. W. Brown, and M. M. 
Mills. Seitz has described the development and present 
status of the theory concerning such radiation induced 
lattice imperfections in solids.' For metals, according 
to the presently accepted picture, the damage is com- 
prised of two aspects (1) Frenkel defects, or interstitial- 
vacancy pairs and (2) effects resulting from thermal 
spikes. 

The assumption has frequently been made! that 
essentially every atom which receives an energy greater 
than a certain threshold necessary to displace it from 
its lattice site will persist as a permanent interstitial 
atom, while its lattice site will remain as a vacancy. 
The number of Frenkel defects which should be pro- 
duced in metals by irradiation has been calculated by 
Seitz on the basis of this assumption.? Thermal spikes 
have been assumed capable of producing effects of 
similar nature to those resulting from heating and 
rapidly quenching the metal. The disordering of ordered 
AuCu; by neutron irradiation observed by Siegel* has 
been cited as an example of such an effect.'* The 
Frenkel defects produced by a given primary knock-on 
atom will all lie within the limits of the associated 
thermal spike. The above assumption regarding the per- 
sistence of the radiation induced interstitials and 
vacancies therefore must include the more basic assump- 
tion that the time duration of the thermal spike is too 
short to permit appreciable annealing of the Frenkel 
defects. 

The present paper represents an attempt by the 
author to construct a model of radiation damage in 
metals based upon somewhat different assumptions. 
While a detailed theoretical investigation of these 
assumptions, as well as those made by other workers, 
would be desirable, the author can at present only give 
qualitative arguments for them and appeal to experi- 
mental work to differentiate between the two models. 





THERMAL SPIKE 


# === INTERSTITIAN ATOMS 

© -~-VACANT LATTICE SITES 
— - --PAaTh OF PRIMARY KNOCK-ON 
oces ---PATH OF SUBSEQUENT KNOCK-On 


INTERSECTIONS OF BACKGROUND LINES 
REPRESENT NORMA LATTICE SITES 


Fic. 1. Schematic representation of radiation damage model 
in two-dimensional square lattice. 


1F. Seitz, Phys. Today 5, No. 6, 6 (1952). 
2 F. Seitz, Discussions Faraday Soc. 5, 271 (1949). 
3S. Siegel, Phys. Rev. 75, 1823 (1949). 
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RADIATION DAMAGE MODEL 


In this section, the atomistic picture of radiation 
damage which will be developed later is described in a 
qualitative manner along with the assumptions upon 
which it is based. These are contrasted with the assump- 
tions made by others, and the resulting models are 
compared. 

The present paper deals only with the damage pro- 
duced in a metal by knock-on atoms and the dependence 
of this damage on the energy of the knock-ons. This 
treatment is therefore independent of the type of 
bombarding particle used. To apply the general 
concepts developed in the present paper to any specific 
type of irradiation, one must calculate the energy 
spectrum of the primary knock-ons resulting from such 
irradiation. By this means, one should obtain a more 
accurate picture of the total damage than by simply 
assuming that each primary knock-on possesses a cer- 
tain average energy as has usually been done in the 
past,?* because the nature as well as the amount of 
damage is found to vary with this energy. 

Concerning the damage from a knock-on atom of 
very high energy, the picture is represented in Fig. 1. 
Here the metal crystal is represented schematically in 
two dimensions as a square lattice, the intersections of 
the background lines representing the lattice sites. 
Interstitial atoms and vacancies are represented as 
indicated, as are the paths of the primary knock-on 
and subsequent secondaries, tertiaries, etc. 

In the present paper (1) the persistence of radiation 
induced interstitials and vacancies and (2) the pro- 
duction of appreciable atomic interchange by thermal 
spikes are assumed to be two mutually exclusive effects 
of radiation damage. This is in contradistinction to the 
usual assumption,' that the interstitials and vacancies 
persist through the duration of the thermal spikes, even 
though the spikes may disorder an ordered alloy. The 
assumption that these two effects are mutually ex- 
clusive leads to two separate regions along the path 
of a high-energy primary knock-on, each retaining a 
different form of damage, as shown in Fig. 1. The high- 
energy region to the left of point A will retain as inter- 
stitial-vacancy pairs all of the displaced atoms produced 
here, and there will be no appreciable atomic inter- 
change among the remainder of the atoms. On the 
other hand, the low-energy region to the right will 
retain essentially none of the interstitial-vacancy pairs 
produced, but the normal atoms will not retain their 
respective sites. Thus, if this region were initially part 
of an ordered superlattice alloy, these atoms will 
become disordered. 

The author was led to make the above assumption by 
the results of the calculations of the next section. It is 
found that, for many metals, the thermal spikes reach 
temperatures well above the normal melting point of 


the material. Seitz! has estimated that the spikes reach ~ 


‘J. D. Ozeroff, KAPL-205, 1949. 
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temperatures of the order of 10‘ °K for periods of the 
order of 10-" sec. The present calculations indicate 
that, in regions which reach temperatures above the 
normal melting point of the material, the concentration 
of interstitial atoms and vacant lattice sites is at least 
several percent of the concentration of normal atoms 
and lattice sites. Thus, the average separation between 
interstitials and vacancies in such regions should be 
only two or three interatomic distances or less. This 
should give rise to large local strains in the material. 

If a true melting point exists for the material which 
is heated to high temperatures by a thermal spike, it 
will be at a somewhat higher temperature than the 
normal melting point at atmospheric pressure, because 
the material will be held under high pressure by the 
surrounding undisturbed lattice. However, the tem- 
perature of the melting point should not be changed by 
more than about a factor of 2. Thus temperatures of 
the order of 10‘ °K are still large compared with the 
melting point, implying that it is not correct to think 
of the material as a solid while it is at high temperature. 
Because of the high pressures and temperatures, it is 
uncertain whether this material can be more appro- 
priately referred to as a liquid or as a dense gas. In 
subsequent discussion, the words “melting” and 
“liquid” will be used to describe this material, but it 
should be remembered that their meanings are not neces- 
sarily conventional. 

The large local strains in the material associated with 
the production of interstitial-vacancy pairs cannot 
persist indefinitely in the liquid, since the lattice sites 
surrounding an interstitial atom in a solid no longer 
exist. Upon melting, an atom, which in the solid was 
an interstitial, is no longer defined as an interstitial. 
The region including a few atoms around the location 
of this atom will contain an abnormally high concen- 
tration of atoms, due to the presence of the “extra” 
atom. Likewise, the region around a vacancy will, after 
melting, have an abnormally low concentration of 
atoms. Thus, immediately after melting, the former 
interstitials and vacancies will be considered to be 
replaced by local ‘density fluctuations.” If the quench- 
ing is extremely rapid, it is expected that these density 
fluctuations will give rise to an equal number of inter- 
stitials and vacancies again upon resolidification. If the 
liquid state is maintained, these density fluctuations 
should relax as a result of the associated local strains. 
If this relaxation time is longer than the time the 
material remains melted, the radiation-induced, inter- 
stitial-vacancy pairs will persist during the melting and 
resolidification, while if it is much shorter, they will not. 
The time for relaxation of these strains has been 
estimated by the author to be of the order of 10” sec, 
as the frequency of oscillation of any atom should be 
of the order of 10" sec~!, and a chain of two or three 
atoms must participate in the relaxation of a given 
“density gradient.” As this is appreciably shorter than 
the time the spike remains melted, the interstitials and 











TABLE I. 

Z Element = ro(A) Ew(ev) Z Element ro(A) E:r(ev) 

11 Na 3.708 180 56 Ba 4.34 860 
12 Mg 3.190 550 57 Ta 3.73 4200 
13 Al 2.856 1200 58 Ce 3.64 5900 
19 K 4.618 140 59 Pr 3.633 6000 
20 Ca 3.93 420 60 Nd 3.62 6500 
21 Sc 3.205 2500 63 Eu 3.960 1500 
22 Ti 2.91 5000 64 Gd 3.554 8300 
23 V 2.627 9600 65 Tb 3.508 10 000 
24 Cr 2.493 15 000 66 Dy 3.499 10 000 
26 Fe 2.476 20 000 67 Ho 3.480 11 000 
27 Co 2.501 20 000 68 Er 3.459 12 000 
28 Ni 2.486 23 000 69 Tm 3.446 13 000 
29 Cu 2.551 23 000 70 Yb 3.866 3700 
30 Zn 2.659 19 000 71 Tu 3.439 13 000 
37 Rb 4.87 150 72 Hf 3.14 33 000 
38 Sr 4.30 610 73 Ta 2.854 73 000 
39 : 3.59 3000 74 WwW 2.734 110000 
40 Zr 3.16 9000 75 Re 2.734 105000 
41 Cb 2.853 25 000 76 Os 2.670 150000 
42 Mo 2.720 36 000 77 Ir 2.709 120000 
44 Ru 2.644 51 000 78 Pt 2.769 110000 
45 Rh 2.685 47 000 79 Au 2.878 80 000 
46 Pd 2.745 43 000 81 Te 3.401 16 000 
47 Ag 2.882 31 000 82 Pb 3.493 14 000 
48 Cd 2.972 21 C00 90 Th 3.59 9000 








vacancies are assumed not to persist in a region which 
has been heated above the melting temperature. 

From considerations of ordinary diffusion data, it 
seems that atomic interchange should not occur appre- 
ciably during the short existence of the thermal spike 
unless the temperature is well above the melting tem- 
perature. This leads to the assumption cited earlier 
regarding two mutually exclusive types of damage. The 
transition energy possessed by the primary knock-on at 
point A in Fig. 1, at which the rate of energy loss 
becomes large enough to anneal the interstitial-vacancy 
pairs and produce appreciable atomic interchange, has 
been calculated for most metals, the values being given 
in Table I. 

The region to the right of point A is assumed to have 
undergone melting and resolidification. It is believed 
that the atomic interchange occurs during the time the 
material is melted, predominantly as a result of a certain 
amount of random motion of the atoms, possibly re- 
sembling turbulence in the flow of a liquid, initiated by 
the relaxation of the local strains when the density 
fluctuations relax. The stored energy released upon 
relaxation of these strains will be sufficient to raise the 
temperature even higher, thereby maintaining the liquid 
state for a brief period after most of the density fluc- 
tuations have disappeared. During this period, the 
turbulent motion initiated by the relaxation of the 
strains can presumably continue to a sufficient degree 
that, upon resolidification, most of the atoms will 
occupy new lattice sites. Thus, essentially all of the 
atoms in this region will be “displaced atoms,” in the 
sense that each will be displaced to a new lattice site. 
A region of crystal which has undergone melting and 
resolidification in such a manner will therefore be 
called a “displacement spike.” 
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The resolidification of a displacement spike should 
occur predominantly on the parent lattice, as it will 
form an ideal nucleus for crystallization. Thus, the 
crystal structure of the material should not be destroyed 
by displacement spikes. A few small microcrystals of 
entirely new orientation may possibly be formed, 
however, somewhat as indicated schematically in Fig. 1. 

The displacement spikes proposed here should not 
be confused with the thermal spikes described by Seitz,' 
as they differ in the following aspects. By their defini- 
tion, displacement spikes cannot sustain radiation- 
induced, interstitial-vacancy pairs, while it has been 
assumed that thermal spikes would not anneal the 
associated interstitial-vacancy pairs. The entire volume 
of a displacement spike is required to have been melted 
and resolidified, thus giving it well-defined boundaries, 
while this is not true for a thermal spike. 

Interstitial-vacancy pairs are still produced, accord- 
ing to the present model, in the region to the left of 
point A in Fig. 1, along with an associated thermal spike. 
This part of the presently proposed picture is therefore 
very similar to the model described by Seitz,' with the 
exception that the thermal spike does not reach a tem- 
perature at which atomic interchange can occur. The 
main difference between the model proposed here and 
the former one, therefore, is the displacement spike 
concept. 


CALCULATIONS 


It is desired to calculate the density of interstitial- 
vacancy pairs produced in the region of the thermal 
spike along the path of a primary knock-on atom. A 
“displacement collision’ will be defined as a collision 
of the primary knock-on with a normal lattice atom in 
which sufficient energy is transferred to the normal 
atom to separate it from its lattice site, creating an 
interstitial-vacancy pair. One must then calculate the 
mean free path of the primary knock-on between suc- 
cessive displacement collisions. 

Basic to a calculation of the mean-free path between 
displacement collisions is a calculation of the cross 
section for scatter of an atom by an identical atom 
with an energy transfer greater than a given minimum 
€o. This in turn involves a knowledge of the interaction 
potential energy between two identical atoms V(r). 
This will be calculated, assuming that the potential of 
each atom is essentially a screened Coulomb potential 
of the form 


o(r) = (Zq/r)e-""", (1) 


where g represents the charge of a proton and Z is the 
atomic number of the atom. From Schiff,® one finds 
that, for moderately heavy atoms, the radius a of the 
atomic electron cloud is of the order of magnitude 


a~ (h?/m@eZ"'), 
or 


a=CaoZ'*, (2) 
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where C is a proportionality factor of the order of unity 
and 
ao= h?/mg?=0.5282X 10 cm 


is the radius of the first Bohr orbit of hydrogen m 
representing the electronic mass. 

Ozeroff* has made a calculation of V(r) based on the 
same model, but his result is apparently in error. The 
detailed calculations are therefore presented in Ap- 
pendix A, giving the expression 


V(r) = (Z7q?/r)e—"!*(1—r/2a). (3) 


This approaches the expected, Coulomb repulsion as r 
approaches 0. At r=2a, it changes sign, becoming a 
weak attractive potential at large distances, with a 
minimum at r=a(1+v3). This is consistent with the 
model used, in which an atom consists of a nucleus sur- 
rounded by the rigid charge distribution p, given in 
Appendix A. It may not be consistent with the true 
physical picture, however, because effects such as the 
redistribution of charge during a collision and exchange 
interactions have been neglected. The closed shell re- 
pulsion between ions is probably the largest effect 
neglected. Certainly, for r somewhat less than 2a, the 
above expression is expected to be approximately 
correct, as the shielded Coulomb repulsion is expected 
to dominate at short distances. 

The potential function, Eq. (1), which has been used, 
is best suited for heavy un-ionized atoms. For the 
lighter metals, the quantitative results of such a poten- 
tial may be inaccurate, because the manner in which 
the screened Coulomb field drops off does not approx- 
imate the exponential form as closely as in the case of 
the heavier elements. However, the range and strength 
of the interaction should still be correct as to order of 
magnitude; and, therefore, it should be possible to 
draw good qualitative conclusions by use of it, which 
should be of the correct order of magnitude, even for 
the very light metals, as long as they are not ionized. 

A good approximation to the upper limit of kinetic 
energy at which a moving atom can be considered 
un-ionized is 

E=(M/m)E ion, 


where M is the mass of the atom, m is the electronic 
mass, and Ej,, is the ionization energy of the atom. If 
it is assumed that an ionization energy of 5 ev is suf- 
ficient to ionize any atom to a point such that the above 
expression for V(r) no longer gives good results, then 
the following table gives the kinetic energies below 
which the present treatment is valid. 


Atom Energy, ev 
Be® 80 000 
Al” 250 000 
Cu® 570 000 


In most types of irradiation, the energy transferred 
to the primary knock-on will be low enough so that the 
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complete history of the primary knock-on and all suc- 
cessive knock-ons can be treated by use of Eq. (3). 

The accuracy of this potential is justified for atomic 
separations less than about 1.5a. At distances between 
about 5a and 10a, the closed shell repulsion is expected 
to be the dominant interaction. Thus, in this range, 
the potential is assumed to be an exponential repulsion 
of the Born-Mayer type,® Ae~®', with constants ad- 
justed to fit observed compressibility data.’ In the 
interval 1.5a<r<5a, the potential will be left unspeci- 
fied; the assumption will be made, however, that it is 
a monotonically decreasing function of r in this range 
connecting smoothly with the assumed expressions at 
both ends. 

Having arrived at a potential function, a satisfactory 
method for calculating the scattering cross section must 
now be chosen. The general condition which must be 
satisfied in order that either the Born approximation or 
the classical treatment may be valid is 


<a, (4) 
where 
k=h/Mv 


is the de Broglie wavelength of the moving particle and 
a represents the order of the dimensions of the scattering 
field. This condition is fulfilled for atoms with energies 
greater than 100 ev when the scattering field is given 
by Eq. (3). Hence, in all collisions involved in the 
present problem, either the Born approximation or the 
classical treatment, or a combination of both, can be 
used according to the conditions described by Williams.® 
The classical treatment can be applied when 


Vb/ho>1, (5) 


where b represents the impact parameter in collisions 
in which the energy transfer is small compared with the 
energy of the moving particle; V represents the strength 
of the potential evaluated at this distance, and v is the 
velocity of the moving particle. In potentials and 
regions of potentials where the opposite condition is 
fulfilled, the Born approximation can be applied. 

Collisions of interest in the present problem involve 
energy transfers of between 1 and 25 ev, where the 
energy of the moving particle is between 1000 and 
100 000 ev. Thus, 6 in Eq. (5) can be considered as 
truly representing the impact parameter. The rela- 
tionship between 6 and « will be determined later, and 
it will then be shown that Eq. (5) is fulfilled for values 
of €9 between 1 and 25 ev as long as Z>10. The clas- 
sical approximation will therefore be used, and it must 
be remembered that the results will only be good for 
metals whose atomic number exceeds 10. 


5L. I. Schiff, Quantum Mechanics (McGraw-Hill Book Com- 
pany, Inc., New York, 1949), p. 168. 

6M. Born and J. E. Mayer, Z. Physik 75, 1 (1932). 

7P. W. Bridgman, The Physics of High Pressure (G. Bell and 
Sons, Ltd., London, 1949), p. 160. 

8 E. J. Williams, Revs. Modern Phys. 17, 217 (1945). 


In Appendix B, the classical treatment of the problem 
is carried through, assuming that Eq. (3) represents 
the correct potential over the entire range of r. On this 
assumption, the expression for the energy transfer is 
found to be 

Z14/8 ER’ 
«= 4——_ —F(b/a), (6) 
CE 


where Ex represents the Rydberg energy 13.52 ev and 
F (x) = {Ki(x) — (%/2)Ko(x)}, (7) 


where K denotes the modified Bessel function of the 
second kind. 

The cross section for scattering with an energy trans- 
fer greater than e for such collisions will then be 


C*Ee 2 
o.=7b?= ra F-1{ ————_ ; (8) 
4E 2Z"i3 


where F—'(x) is the inverse of the function defined in 
Eq. (7). 

The function F(x) is shown as the solid line in Fig. 2. 
According to Eq. (6) this function is proportional to 
the energy transfer when the impact parameter is ax. 
It is noticed that at «=2.4, the curve drops to 0, so 
that a particle with an impact parameter equal to 
2.4a should not be deflected. The reason for this is 
apparent from the potential used, which has its mini- 
mum at 2.73a. Thus, an impact parameter of 2.4¢ 
apparently corresponds to the path which a particle 
must follow if it is acted on equally by the attractive 
and repulsive forces, giving no net deflection. 





100 





SOLID LINE --- THE FUNCTION , Fin), 
OEFINED Ww Eq iT) 


BROKEN LINE--- THE CORRECTED Fix) FUNCTION 

















Fix) 







































































108 | 

0 = a et ae ee ae 

Fic. 2. Function F(x) as defined in Eq. (7) and corrected 
F(x) function. 
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Consider now the effects of having neglected the 
closed shell repulsion. This interaction is approximated 
by a simple exponential form Ae~®*, which is just the 
negative of the form approached by Eq. (3) for r>>2a. 
The energy transfer in such a collision will be inde- 
pendent of the sign of the potential and will depend 
only on the magnitude. The magnitude of the closed 
shell repulsion has been calculated for several of the 
inert gases, and it can be estimated from experimental 
compressibility data’ for metals. In general, the evalu- 
ation of A and B from compressibility data yields 
values comparable with those of the attractive term in 
Eq. (3). Thus, the total interaction at distances greater 
than about 7a is of the same order of magnitude as 
given by Eq. (3) and is positive rather than negative. 
The energy transfer for large impact parameter should 
therefore be of the same magnitude as that calculated 
here. Thus, for large x, although the sign of the potential 
is wrong, the F(x) curve in Fig. 2 should be approxi- 
mately correct. 

At distances somewhat less than 2a, Eq. (3) is 
expected to represent the true interaction, as the 
Coulomb interactions should predominate here. The 
F(x) curve in Fig. 2 should therefore be correct up to 
about += 1.5. 

As the true potential is a monotonically decreasing 
function both at large and small distances, rather than 
a function whose slope changes sign, as is Eq. (3), there 
seems to be no further reason for supposing that it has 
a minimum in the neighborhood of 2a. It will therefore 
be assumed that V(r) is a monotonically decreasing 
function of r throughout the range 0<r<10a. This 
will then eliminate the zero in the F(x) curve and 
demand that F(x) be a monotonically decreasing func- 
tion of x. It is therefore assumed that the true F(x) 
curve can be obtained by bridging the gap smoothly, 
giving the broken line in Fig. 2. 
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Fic. 3. The mean free path of a knock-on atom between dis- 
placement collisions / as a function of its energy E in cases where 
E>e. 


® A few examples of such calculations are J. C. Slater, Phys. Rev. 
32, 349 (1928); P. Rosen, J. Chem. Phys. 18, 1182 (1950); W. E. 
Bleick and J. Mayer, J. Chem. Phys. 2, 252 (1934); M. Kunimune, 
Progr. Theoret. Phys. 5, 412 (1950). 
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It would be possible to estimate an F(x) curve for 
each metal, basing the right-hand end on the compres- 
sibility data for the particular metal. This process is 
tedious and seems unwarranted as a result of the uncer- 
tainty of other approximations which must be made. 
Rather, the corrected F(x) curve in Fig. 2 will be 
assumed as an average and will be applied to all metals. 
Therefore, throughout the remainder of the present 
paper, and in the use of Eqs. (6) and (8), F(x) will be 
taken to be defined by the broken curve in Fig. 2, 
rather than by Eq. (7) which gives the solid curve. 

For purposes of determining whether or not Eq. (5) 
is fulfilled, the form of the assumed potential at large 
distances can be taken approximately as the magnitude 
of the attractive part of Eq. (3), 

2? 2 
V(r) >—e"*". (9) 
2a 


Using this and Eq. (6), it can now be seen that Eq. (5) 
is fulfilled if e.£> 1000 (ev)? and Z>10. 
The mean free path between displacement collisions 
is given by 
l= 1/ Notes (10) 


where o,, is the cross section for a displacement col- 
lision and N» is the density of atoms. It will be assumed 
that, for both close-packed and body-centered cubic 
metals, Vo is given by 


No= 1.4/r,3, (11) 


where fro is the interatomic distance. This is accurate 
only for close-packed metals, but is less than 8 percent 
in error for the body-centered cubic structure. Equation 
(8) will be used for o,,. One can then pick any metal 
with a given Z and ro, assume a value for the displace- 
ment energy €o and plot / vs E. 

A more general curve can be obtained, however, for 
all metals by plotting the quantities P and Q, which are 
defined by the expressions 











lag 
em (12) 
Ee 
= (13) 
Then, from Eq. (8) it is seen that if one sets 
F (x) = (C?/4Ep’)Q, (14) 
the cross section for displacement is 
aC7a,? 
Co= Tex? = as at. (15) 
Therefore, combining Eqs. (10), (11), (12), and (15), 
P=1/14nC2x2. (16) 


The value of C will be taken as 2.09, as used by Ozeroff' 
to agree with the Thomas-Fermi atom model, Then, 
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Fic. 4. The mean free path of a knock-on atom between dis- 
placement. collisions / as a function of its energy E in cases where 


“~ €0. 


from Eqs. (14) and (16) and the F(x) curve, one can 
plot P vs Q, giving the curve in Fig. 3. For any given 
metal, one knows Z and ro; and, by estimating a value 
for €9, one obtains the / vs E curve for a particular 
metal from Eqs. (12) and (13) as P and Q are directly 
proportional to / and E, respectively. 

The impulse approximation was used to obtain the P 
vs Q plot in Fig. 3. It is therefore possible to set a lower 
limit on the values of Q for which this curve is valid, by 
applying the validity criterion for the mngumee approxi- 
mation, namely, 

eK. 
This gives 
OD e?/Z4', (17) 


To obtain a rough idea of the form of the / vs E curve 
for lower values of E, it will be assumed that the scat- 
tering can be treated as hard sphere scattering, the 
cross section for scattering with an energy transfer 
greater than ¢9 being given approximately by 


oeg= hy 2c 1— (€)/E) |, (18) 


where bo represents the distance of closest approach of 
the atoms in a head-on collision. 

To obtain an expression for bo, the form of the poten- 
tial at large distances will be taken as that given in 
Eq. (9). Setting this equal to E, one can solve for do, 
giving 

b=— 1 


™, Zils En R 
n(—— (19) 
VA 


Combining Eqs. (10), (11), (12), (18), and (19), it 
is found that in this ar limit, P is given by 


“al DE 


In this case, it is not possible to choose a single coor- 
dinate proportional to E which will give a single curve 
representing all metals. Thus, a separate curve must be 
plotted for each set of values for Z and ¢9. Figure 4 has 
been plotted showing the family of / vs E curves which 
one obtains by choosing €9 as 25 ev. The curves from 
Figs. 3 and 4 have been combined to give the / vs E 
curve for copper as a typical example in Fig. 5. While 
the two curves do not join exactly, the agreement 
between them is satisfactory for present purposes. 

The displacement energy €9 assumed to be 25 ev, is 
in general of the order of, or less than, 100 times the 
energy per atom necessary to melt the material within 
a displacement spike. From this fact, one can see from 
Eq. (8) and Fig. 2 that the number of displaced atoms 
in any region should be about one-fourth the number of 
atoms which are heated to the melting temperature. 
Thus, in regions in which the lattice is heated to suf- 
ficiently high temperatures that melting may occur, 
the density of displaced atoms according to this theory 
should be about 25 percent of the total atom density. 
While this figure may be somewhat in error as a result 
of the rather crude approximations made in the pre- 
ceding calculations, it seems certain that the number 
should be at least 5 percent, supporting the assumption 
that radiation induced Frenkel defects cannot persist 
in a region which has been melted and resolidified. 

A criterion must now be set up to determine whether 
or not such melted regions are produced, and what the 
transition energy E;, of the knock-on is when such 
production starts. There are three factors which con- 
tribute to the heating of a displacement spike: (1) the 
nondisplacement elastic collisions of the knock-on atoms 
with the normal atoms of the lattice; (2) the transfer of 
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Fic. 5. The mean free path of a copper knock-on atom between 
displacement collisions / as a function of its energy E. 
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energy to the lattice from the electrons which have been 
excited by the knock-on atoms; and (3) the release of en- 
ergy as a result of the annealing of the Frenkel defects. 
Calculations made by Brooks" show that the energy im- 
parted to the electrons by a knock-on atom is dispersed 
among a large number of electrons before it can produce 
an appreciable temperature rise in the atomic lattice; 
hence Process (2) is negligible. Process (3) occurs as a 
result of Process (1); therefore, the heating as a result 
of Process (1) alone must be great enough to initiate 
the action which is then continued by Process (3). It 
will therefore be assumed that the lattice heating re- 
sulting only from nondisplacement elastic collisions of 
the primary knock-on with lattice atoms must be suf- 
ficient to melt a continuous cylindrical region along its 
path, from two to four interatomic distances in diam- 
eter, in order to initiate the production of a displace- 
ment spike. Thus, the value of E;,, will be taken as the 
energy of the primary knock-on at which the rate of 
energy Joss to nondisplacement elastic collisions is just 
large enough to heat to melting temperature from 4 to 
12 lattice atoms per interatomic distance along its path. 

This criterion can be simplified somewhat by con- 
sidering the previously estimated ratio of the relative 
numbers of displaced and melted atoms. One can 
therefore estimate that the value of E,, is the energy 
of the primary knock-on at which the mean free path 
between successive displacement collisions becomes of 
the order of one interatomic distance. This will be 
taken as the criterion for the determination of £,,. It is 
therefore seen that E,, for copper will be given by the 
second intersection of the curve in Fig. 5 with the 
horizontal broken line, which represents a mean-free 
path between displacement collisions equal to one 
interatomic distance. 

Similarly, values of E,, for all metals can be obtained 
from Fig. 3 by calculating the value of P corresponding 
to one interatomic distance in the given metal, and 
determining E,, from the corresponding value of Q, 
given by the curve. Table I has been calculated by this 
method, assuming € to be 25 ev for each metal. 


THE NATURE OF DISPLACEMENT SPIKES 


From Table I, one deduces that, if the radiation 
damage model proposed in this paper is correct, the 
damage produced in the heavier metals by most of the 
common kinds of irradiation should consist primarily 
of displacement spikes. Only in a few of the lightest 
metals should the production of interstitial-vacancy 
pairs be appreciable. One likely means of experimentally 
differentiating between this model and the former one 
should be a determination of the number of interstitial- 
vacancy pairs produced in some of the heavier metals 
by pile neutron irradiation. The present model predicts 
few or none, while other workers have assumed this to 


be the primary damage resulting from such irradiation.: 


Another difference is that the former model assumes 


” H. Brooks (private communication). 





BRINKMAN 


that the damage produced in both the light and heavy 
elements is of essentially the same nature, the primary 
effect being the production of interstitial-vacancy pairs. 
The presently proposed model, however, predicts that 
the damage in the heavy elements will be of a different 
nature than in the light elements, consisting chiefly of 
displacement spikes. In this connection, some specu- 
lation on the nature of displacement spikes and on the 
differences expected to be observed in the physical 
effects resulting from displacement spikes and from 
interstitial-vacancy pairs seems appropriate. 

It is easy to imagine processes by which dislocation 
loops can be formed within a displacement spike during 
its resolidification. The formation of small microcrystals 
of new orientation, as illustrated in Fig. 1, should be a 
more difficult process and, if it occurs, should be much 
less frequent than the production of dislocation loops. 
The reason for this is that the boundary of such a small 
microcrystals should consist of an array of these dis- 
location loops. Thus it seems that small dislocation 
loops should be one of the primary products of displace- 
ment spikes. 

As a result of the small size of these dislocation loops, 
they will be in strong tension; and, as a result, they 
should anneal easily, simply by collapsing. The activa- 
tion energy for annealing should increase with the size 
of the loop. Thus, the annealing of this type of damage 
can be expected to have a variable activation energy, 
giving an annealing process which occurs over a rather 
wide range of temperature. It should be possible to 
observe annealing beginning at very low temperatures, 
but which will not run to completion until the tem- 
perature is raised considerably higher. 

This is in contrast to the annealing which one should 
observe for a process involving only a single activation 
energy. Here, if a temperature is found at which an- 
nealing will begin, the process should run to completion 
at this temperature. From measurements of the an- 
nealing of radiation damage, it may therefore be possible 
to separate the two models, because the annealing of 
interstitials and vacancies should be characterized by a 
small number of discrete activation energies rather than 
a continuous range. 

The size of displacement spikes can probably be 
estimated crudely by dividing the energy available for 
production of the spike, which is just the energy of the 
primary knock-on at the time spike production starts, 
by an average energy per atom when the spike is in 
the melted condition. The energy per atom necessary 
to melt typical metals at atmospheric pressure is 
between 0.1 ev and 0.2 ev. The metal within displace- 
ment spikes will be held at high pressure by the sur- 
rounding lattice, raising the melting point somewhat, 
to a value which should still be less than 0.5 ev. To 
account in a rough manner for heat losses to the sur- 
rounding nonmelted lattice and to the electronic system, 
the estimate will arbitrarily be raised to the order of 
1.0 ev per atom as the amount of available energy dis- 
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sipated by each atom of the spike. Using this figure, 
one obtains an average size for the displacement spikes 
produced by 2-Mev neutrons in copper of about 2X 10* 


‘atoms. If the region is assumed spherical, this represents 


a sphere about 75A in diameter. This then also repre- 
sents an approximate upper limit on the size of dis- 
location loops which can be produced. 

It was argued earlier that a density of interstitial- 
vacancy pairs giving an average vacancy-interstitial 
separation of only two or three interatomic distances 
would not persist throughout the associated thermal 
spike, because the relaxation time of the pairs was prob- 
ably short relative to the melted period of the spike. It 
is possible, however, that a lower concentration of 
interstitial-vacancy pairs could persist during the tem- 
perature pulse. In particular, one might think that an 
average vacancy-interstitial separation of 8 to 10 
interatomic distances might have a relaxation time 
somewhat longer than the time the spike remains 
melted. Relaxation of the interstitial-vacancy pairs 
therefore may be thought of as progressing until the 
concentration of atoms in interstitial positions is 
decreased to the order of 10-*, at which time resolidi- 
fication occurs, freezing in the remaining interstitial- 
vacancy pairs. Hence, it is possible that a few in- 
terstitial-vacancy pairs will be produced in each dis- 
placement spike, the maximum concentration being 
estimated to be of the order of 10~*. For pile neutron 
irradiation on copper, this corresponds to about 20 
interstitial-vacancy pairs per primary interaction, a con- 
siderably smaller number than one would obtain from 
calculations based on the earlier model, of the type 
made by Seitz.” In the lighter elements, Be, C, Na, Mg, 
and Al, the two models should predict about the same 
type of damage, as the size of the displacement spikes 
in such cases should be negligible. Attempts to experi- 
mentally differentiate between the two models should 
therefore be carried out on the heavier metals. 


CONCLUSIONS 


It would be desirable to measure a direct effect of 
irradiation on some physical property of the heavier 
metals which could be definitely assigned to either 
interstitials and vacancies or to dislocation loops. Such 
an effect, however, may be difficult to find, as both 
should increase the electrical resistivity, both are ex- 
pected to produce increases in hardness, and other 
effects are probably common to both. Two more 
indirect methods have been described in the last section 
by which the accuracy of the present model may be 
checked. These are (1) an analysis of the dependence of 
the nature of the damage on atomic number and (2) an 
analysis of the annealing of property changes produced 
by the damage. It is suggested that such experiments 
be carried out on metals irradiated with pile neutrons 
at temperatures as low as possible. The average size of 
displacement spikes produced by pile neutrons should 
be considerably larger than those produced by cyclotron 
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irradiation. Thus the production of dislocation loops 
and other displacement spike effects should be more 
pronounced in neutron irradiation. In order to retain 
as much of the damage as possible, the metals should be 
held at temperatures as low as can be maintained during 
irradiation, because, according to the present model, 
displacement spike effects may begin to anneal at tem- 
peratures below that of liquid nitrogen. 
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APPENDIX A 


Consider a system of two atoms. The calculation is 
simplified considerably by imagining the atoms to be 
nonidentical at first. The separation distance will be 
denoted by &, the first being located at coordinates R, 
and the second at Re. The respective potentials will be 


exp(— | r—R,| /a:) 








gi=Z » 
1 19 ir—R,| 
exp(— | r—R,| /az) 
oi1=Z29 ’ 
|r—R,| 
and as 
1 
ile ~— FY, 
4a 


the associated charge distributions are 


Z.qg{exp(— | r—R,| /a;) 
aia p( | , asl — 45 (rR) |, 
4a a;?|r—R,| 


Z2q {exp(— | r—Re| /a2) 
ee 


4a a*|r—R,| 








— 415°) (r—R,) ’ 


where 6° is the three-dimensional 6 function repre- 
senting the nuclear point charge. The electrostatic 
energy of the system is given by 


1 
W= -f (¢1+¢2) (pi+p2)d®r. 
2 all space 


The cross terms will give the interaction energy 


1 
v=-f 
2 


all space 


(dipo+o2p1)dr, 


The two terms in this integral must be equal as a con- 
sequence of Green’s reciprocation theorem,'' and there- 
fore 


V= piped ®r, 
all space 


1 W. R. Smythe, Static and Dynamic Electricity (McGraw-Hill 
Book Company, Inc., New York, 1950), p. 34. 
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An equivalent result should be obtained if 2p; is 
integrated instead of ¢:p2. Thus, 


V exp(— ®/a,) 1 

















Z 1229" R 4a 2 
exp(— 'r—R;,! ‘a \r—R,| ‘a») 
all space ir—R, | ir—R,| 
exp(— ®/a2) 1 
R 4ra;’ 
exp(— ir—R,| ‘a,—|r— | ‘dy 
xf — —d®y, 
all space r—R,| ir—R,| 
and therefore 
exp(— | r—R, | ‘ay— | r—R.,| ‘d») 
f oitienananeinptemennnsnerinhnan ea d®@r 
all space ir—R,| ir—R, 
exp(— ®/a:)—exp(— ®/a2) 
= dra 2a.2—_—______—____—_. 
R (a r— a") 
Thus, , 
Z,Z.q° (4° exp(— ®/a2)— a2? exp(— ®/a1) | 
V(a)=———_ ——— 
R | Ra’—a" 
for a\¥ a2. 


The atoms can now be considered identical again, 
setting 


a\>=da2=a, 2:=2:=2Z. 


V (®) then takes on an indeterminate form, which when 
evaluated becomes 


q R 
V(@)=— exp(—a/a)(1-—-). 
R ? 


<a 


APPENDIX B 


As € is generally small relative to the energy of the 
moving particle, the classical treatment can be sim- 
plified by use of the “impulse approximation,” in which 
the moving atom is considered undisturbed by the 
stationary atom. The impulse given to the stationary 
particle is given by 


+ +27 M\} 
I= f F,dt=2 f ( ) Fedx, 
» \2E 


=< 





where F,, is the component of the force exerted on the 
stationary atom by the moving atom perpendicular to 
the path of the moving atom, and is given by 


F,=F,siné, F,=—dV/dr. 


Here, @ is the angle between the line joining the two 
atoms and the path of the moving atom, and V is 
given by Eq. (3). This gives 


I 2M? 
ae (—) {9:1 + 52+ 93}, 
24 E 
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where 
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and 6 is the impact parameter. 
—2a*J, is just the Laplace transform of the function 


0 for 0<r<b 
1 
(2)! 








for b<r<a, 


which is equal to Ko(b/a), where K denotes the modified 
Bessel function of the second kind. The integrals 9, 
and g; can then be obtained by integrating 4g, with 
respect to 1/a. Thus, 


1 
I\= ——K,(b ‘a), 
Va? 


2a 
Ko(xb)dx, 


1 
$2= 653 f 
a l/a 
a= f f Ko(xb)dxdy. 


lja“ y 


30 


By use of the formula 


f ’ iy f(x)dx=— J ’ (a—x) f(x)dx. 


y 


$3 can be shown to be equal to 


on -f- [ (1/a)—x ]Ko(xb)dx. 


Thus 
Sot a= f xKo(xb)dx= (1/ab)K,(b/a), 
al l/a 
2 f2M\} 4 
? a 2. {K;(6/a)—3(b/a)Ko(b/a)}. 
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The energy transfer is 
Pr Zi4/8 ER’ 
=>—_ = 4 
2M CE 
where C is defined in Eq. (2), and 
Er=@/2a9= 13.52 ev= Rydberg energy, 
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A Capillary Viscometer with Continuously Varying Pressure Head 
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A new capillary viscometer has been designed to study the flow behavior of non-Newtonian fluids. The 
instrument is convenient, absolute, and accurate, and covers continuously a wide shearing stress range in a 
single determination. A falling mercury column forces the sample through the capillary. Measurement of 
the column height as a function of time gives both pressure drop and flow rate. Provision is also made for 
the use of driving fluids less dense than mercury for measurements at lower shearing stresses. The density 
of the sample need not be determined. Methods and illustrations are given of the application of the new 
instrument to the determination of the absolute viscosities of Newtonian liquids, and the flow curves of 


non-Newtonian fluids. 





INTRODUCTION 


APILLARY viscometers find very wide application 
for the measurement of the flow characteristics of 
fluids. In the case of Newtonian fluids the observation 
of the rate of flow at a single pressure drop across the 
capillary is sufficient to define the flow behavior. How- 
ever, in the case of non-Newtonian fluids a range of 
shearing stresses must be employed, and this is usually 
obtained by varying the pressure drop. An effective 
means of accomplishing this pressure variation was de- 
vised by Ostwald and Auerbach,! who allowed the liquid 
to drain from a buret under its own pressure through 
a capillary, while the times corresponding to various 
buret readings were recorded. From these data could 
be calculated the pressure drops and the flow rates. 
By this means pressure heads ranging from the height 
of the buret down to zero could be obtained in a single 
determination. 

Giesy and Arzoomanian,”? working with stiff pastes, 
extended the pressure range and eliminated drainage 
errors involved in the Ostwald-Auerbach viscometer by 
using mercury as a drive fluid to force the sample from 
a reservoir through a horizontal capillary. 

The viscometer described in this paper is a modifica- 
tion of the previously mentioned buret-capillary instru- 
ments in the direction of increased precision, versatility, 
and convenience of operation. Also presented below is 
a novel method of treating the data for recovery of the 
rate of shear-shearing stress relation of the fluid studied. 


DESCRIPTION AND OPERATION OF NEW 
VISCOMETER 


A sketch of the new viscometer is shown in Fig. 1. 
The apparatus, constructed entirely of glass, consists of 
a capillary unit I and a manometer unit II connected 
by a ball and socket joint E. The capillary unit is com- 
posed of a water jacket F inside of which are sealed the 
bulb C and tube D of the same diameter, joined by the 
capillary A and sidearm tube B. The function of the 


1W. Ostwald and R. Auerbach, Kolloid Z. 41, 56 (1927). 

*P. M. Giesy and S. Arzoomanian, Colloid Symposium Mono- 
% ph (Chemical Catalog Company, Inc., New York, 1928), Vol. 
/, p. 253. 


sidearm is to facilitate filling of the unit with sample, 
and to speed attainment of hydrostatic balance by by- 


passing the capillary. During a run the sidearm stop- 
cock is kept closed. 


The manometer unit II consists of two precision-bore 
tubes, ca 0.34 cm in diameter and 100 cm long, a bulb 
WV, and a 20-ml hypodermic syringe joined to the mani- 
fold as indicated. Tube m is used for mercury, while 
tube w is for less dense driving fluids. The manometer 
tubes are placed against a scale of millimeter graph 
paper, protected with cellulose acetate film, to permit 
reading of the column heights. The syringe is used to 
adjust the amount of mercury in the system and the 
mercury levels. All the stopcocks shown are large bore 
and fitted with pressure adapters. As may be seen from 
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Fic. 1. Diagram of new viscometer. 
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the drawing, only mercury comes in contact with the 
sample even when a different driving fluid is employed. 

To permit variation of capillary size, four units such 
as I were constructed. These differed only in the bore 
of the capillaries sealed in, and they could readily be 
interchanged through the ball and socket connection £. 

To operate the instrument, the units are assembled 
and held in place by means of a clamp applied at the 
ball joint, and the stopcocks and syringe are manipu- 
lated to raise the mercury in unit I to the top of bulb C. 
Approximately 20 ml of sample are then introduced 
through the top of unit I, and drawn into bulb C and 
the sidearm by retraction of the syringe. The syringe is 
then advanced to force out any air present and to fill 
the capillary with sample, the levels of mercury and 
sample are adjusted to a point such as shown in Fig. 1, 
and stopcocks a and 6 are closed. Finally, pressure is 
applied to the syringe to fill the appropriate manometer 
column with fluid, and stopcock d is closed. At this 
stage of the operation only stopcock 6 prevents the 
driving fluid from pushing the sample through the 
capillary. 

After the sample has been brought to equilibrium 
with the water circulating through F from a thermostat, 
stopcock 0 is opened, and the times / are noted at which 
the drive fluid level passes various heights z in the 
manometer column. For this purpose a stopwatch with 
a “split second’”’ following hand greatly facilitates the 
timing. When the desired range of pressures has been 
covered, stopcock a is opened, and the height 2) to 
which the drive fluid falls at hydrostatic balance is read. 
An accurate determination of 2» is necessary since this 
figure enters into the calculation of the effective driving 
height. With stopcock a closed, 2» is approached asymp- 
totically with time, and a long period elapses before 
attainment of hydrostatic balance. With the aid of the 
sidearm, however, this balance is reached rapidly, and 
z) can be read without waiting. 


The sample may at this stage be returned to the lower 


bulb, and the determination repeated as many times as 
desired. 

At the end of an experiment the syringe is used to 
draw mercury from the sample bulb to just above stop- 
cock 6, stopcocks 6 and ¢ are closed, and the capillary 
unit removed for cleaning. By this procedure only 2-3 
ml of mercury are contaminated per determination. 
The capillary units are readily cleaned with either hot 
nitric acid or dichromate-sulfuric acid cleaning solution. 


PRINCIPLE OF OPERATION 


Consider first the operation of the instrument with 
mercury as the drive fluid. Let r,, be the radius of 
manometer tube m, and let A be a capillary of radius 
R and length L. Also, let the radius of bulbs C and D, 
which are of the same diameter, be Rc. Since bulbs 
C and D have the same diameter, the height s of the 
sample column, Fig. 1, remains constant during a deter- 
mination. 
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In Fig. 1 let x and z be the heights of the mercury 
columns in the U-tube system above a horizontal base 
line as indicated. When the sample is flowing through 
the capillary tube with a volume rate of flow Q, the 
pressure drop P across the capillary is 


P= 2p mg — (XpmZ+Sp.8) (1) 


where p,, and p, are, respectively, the densities of mer- 
cury and sample, while g is the acceleration due to 
gravity. When the system is in hydrostatic balance 
x=xXy and z=2»), and thus 


O= 20pmg— (XoPmg+Sps8). (2) 
Subtracting Eq. (2) from Eq. (1), we get 
P=[(s—20) — (x—x0) long. (3) 


Since the volume of mercury in the system remains 
constant, 





Tn 
X— Xy= ——(s—20) (4) 
C 
and hence Eq. (3) becomes 
Tm y 
P= (s—seng( 1+~ ). (5) 
c 
Letting now h=z—20, 
ae 
P=hong( 1+.) (6) 
Re 
or 
P=a,h (7) 
where 
Tn 
c= pug ( 1+ ): (8) 
Re 
The volume rate of flow is given by 
dh 
Q= ar, (9) 
dt 


From Eqs. (6) and (9) it is evident that determina- 
tion of / as a function of /, as the sample flows through 
the capillary under the driving pressure of the manom- 
eter, allows calculation of both P and Q. It should also 
be observed that the density of the sample p, does not 
enter the equations, and hence need not be known. 

When an alternate driving fluid of density p, is used 
in manometer w of radius r,,, the pressure drop is 


P= (2—Y) puS+ Yom — Spx — XPmf- (10) 
At hydrostatic balance, P=0, and hence 
O= (Z0— Yo) Pw&+ VoPmZ — SPs¥— XoPmg- (11) 


Where yo is the value of y, the height of mercury in 
bulb W, at hydrostatic equilibrium. Subtracting Eq. 
(11) from Eq. (10), we obtain 


(12) 


P= (s— 20) PwE+ (y— yo) (Pm— Pw)g— (x— 2X0) Pmg.- 
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Setting again z—2z)=/, and noting that 


























Vs ee 
¥—Yo= (2—20)— =h—, (13) 
Ry? Rw 
where Ry is the radius of the bulb W. Thus, 
Fes Tuo? 
_ (x— xo) = (s— 29) —=h ’ (14) 
Re Re 
Eq. (12) leads to 
| a Pm Rw 
P= peg 1+ |?"( 14 )-1 |, as) 
Rw’Low Re 
or 
P=a,h, (16) 
where 
Tat Pm Ry’ 
ccomalir2f(142%)-s]- an 
Ry’ Pw Re’ 
When Rw= Rc, this equation simplifies to 
Pa 2pm 
woofs 2( i) 
Ry’ Pw 


and this is the value of a, to be used in Eq. (16). As 
before the rate of flow is given by 


' dh ' 

Q=—nfr,,"—- (19 
dt 

From Eqs. (16) and (19) it may be seen that again Q 

and P follow from measurements of / and /, and that 

again the density of the sample is not needed. 


FLOW OF NEWTONIAN FLUIDS IN VISCOMETER 


According to Poiseuille’s equation the viscosity 7 of 
a Newtonian fluid is given by 








wR'P 
n= ' (20) 
8LQ 
Substituting the relations for P and Q deduced above, 
we get 
Rta 1 
eee (21) 
8Lr* d\nh 
dt 
or 
d logiok Rta 1 B 





dl 8(2.303) Lr? 9 n 
where, depending on the drive system used, a=am OF aw 
and r=r,, or fw. From Eq. (22) it follows that for a 
Newtonian fluid a plot of logioh vs ¢ should be linear, 
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Fic. 2. Typical plots of logioh vs time for Newtonian liquids 
(white oils). 


and the viscosity should be obtainable from the slope 
of this line and the instrumental parameters. 

Figure 2 shows two typical plots of logy vs ¢ ob- 
tained with Newtonian fluids. Table I, in turn, lists 
data observed on the flow of a number of liquids of 
known viscosity in four different capillaries with both 


_ mercury and water manometers. In all instances slopes 


were determined by least squares. According to Eq. (22) 
the product —nm should be a constant B for each 
capillary, and this is actually the case. 

The constants B can also be obtained from the instru- 
mental dimensions. For this purpose capillary radii 
were determined by partially filling the bore with mer- 
cury, measuring the length of the mercury thread with a 
cathetometer, and then weighing the mercury. The radii 
of the manometers and bulbs were found by introducing 
weighed quantities of mercury into these, and meas- 
uring the differences in levels produced. Finally, the 
lengths of the capillaries were obtained by filling them 
with mercury, and measuring the electrical resistance 
with a Mueller bridge. In these operations corrections 
were made for the resistances of the leads. 

The constants B deduced from instrumental dimen- 
sions are given in the sixth and ninth columns of 
Table I. The agreement between these and —ym is 
excellent, being in all instances but one better than 0.5 
percent. Consequently, within the indicated accuracy 
the instrument may be operated as an absolute vis- 
cometer. 
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TABLE I. Flow data for Newtonian liquids at 30°C. 








Mercury drive 








Water drive 





L ” Bexp X105 % Bexp X105 % 
Capillary cm X10 cm poises (—nm) B calc X105 Deviation (—nm) B calc X105 Deviation 
0.007982 4.096" 4.371 4.354 +0.39 
A 1.233 14.59 0.02607 4.090 
0.1164 4.090 
av 4.092 4.112 —0.49 
0.1164 26.02 27.45 27.52 —0.25 
B 1.948 14.47 0.2145 25.91 
0.4213 25.96 
av 25.96 25.98 —0.08 
0.7249 157.2 167.3 166.7 +0.36 
c 3.033 14.14 0.9269 157.4 
av 157.3 157.8 —0.32 
0.9269 394.5 
D 3.818 14.09 1.0086 394.2 419.4 416.6 +0.67 
av 394.4 394.1 


+0.08 








* Kinetic energy correction applied. For application of this correction see Appendix. 


FLOW OF NON-NEWTONIAN FLUIDS 
IN VISCOMETER 


In Fig. 3 is shown a plot of logio# vs ¢ obtained with 
a 62.2 percent solids synthetic latex at 30°C. The non- 
linearity of the graph indicates that the latex is non- 
Newtonian. The problem involved, then, is the com- 
putation of the rate of shear-shearing stress relation of 
the fluid from these data. Two procedures may be em- 
ployed for this purpose. The first, or incremental one,’ 
involves definition of a mean rate of flow Q as 


hy— hy 





Q=—rr (23) 


b—th 
and a logarithmic mean pressure head 


ho— hy 





h= (24) 
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Fic. 3. Plot of logioh vs time for a synthetic rubber latex at 30°C. 


3G. Barr, A Monograph of Viscometry (Oxford University Press, 
New York, 1931). 


as the effective head for Newtonian liquids. This 
mean value of / is usable for non-Newtonian fluids if 
(ht2—h)/h\<1. In terms of these the apparent fluidity 
¢, becomes 


8LQ 
—-— (25) 
rRah 
and the maximum shearing stress F’,, 
RP Reh 
as ee (26) 
2L «2L 


Now, it was shown by Krieger and Maron‘ that the 
relation between F,, and the rate of shear g(Fm) is 
given by 


(Fm) 1 ding, 

_ 64 + ~ 
Fr, 4 dlnF,, 
Consequently, construction of a plot of logioa vs logioh 
and its differentiation at various values of h yields 
d logiwa/d logioh, which is identical with d In¢./d InF n. 
With the values of this quantity determined, g(F,,) can 
be calculated for various values of F,, to obtain the 
flow curve. 


In the second, or differential method, of treating the 
data F,, is 








(27) 





Rah 
Fh sateen (28) 
2L 
and ¢, from Eq. (22) is 
8 (2.303) Lr’?m m 
a= — =——-: (29) 


R'a B 
‘I. M. Krieger and S. H. Maron, J. Appl. Phys. 25, 72 (1954). 
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TABLE IT. Determination of rate of shear-shearing stress relation for 62.2 percent solids synthetic latex 
by differential method at 30°C. 





























on Capillary D: R =3.818 X107-2 cm, L =14.088 cm 
— Mercury manometer: rm =0.1674 cm, Rc =1.268 cm 
9 dm 
h t dt & (Fm) Fm g(Fm 
cm Sec —m X10° 105 A® da Fn dynes/cm? sec! 
94.25 0 4.605 1.799 0.0921 1.168 1.276 1723 2199 
5 89.25 5.4 4.455 1.759 0.0962 1.130 1.239 1632 2022 
84.25 10.8 4.400 1.734 0.0972 1.116 1.224 1541 1886 
79.25 17.0 4.295 1.695 0.0998 1.090 1.199 1449 1737 
74.25 23.8 4.178 1.684 0.1048 1.060 1.171 1358 1590 
69.25 31.2 4.065 1.653 0.1086 1.031 1.143 1266 1447 
6 64.25 39.2 3.925 1.589 0.1120 0.996 1.108 1175 1302 
59.25 48.8 3.762 1.522 0.1167 0.954 1.065 1083 1153 
54.25 58.2 3.638 1.464 0.1201 0.923 1.034 992.0 1026 
49.25 70.8 3.440 1.389 0.1275 0.873 0.984 900.5 886.1 
44.25 85.2 3.248 1.322 0.1362 0.824 0.936 809.1 757.3 
7 39.25 101.4 3.022 1.246 0.1481 0.767 0.881 717.7 632.3 
34.25 121.6 2.788 1.134 0.1585 0.707 0.819 626.3 512.9 
es: 29.25 147.0 2.562 1.016 0.1680 0.650 0.759 534.8 405.9 
a 24.25 181.0 2.268 0.881 0.1861 0.575 0.682 443.4 302.4 
20.25 218.2 1.940 0.751 0.2168 0.492 0.599 370.3 221.8 
16.25 272.2 1.600 0.582 0.2409 0.406 0.504 297.1 149.7 
This 14.25 310.8 1.425 0.480 0.2566 0.361 0.454 260.6 118.3 
ie if 12.25 363.4 1.170 0.365 0.2896 0.297 0.383 224.0 85.8 
s 1 10.25 436.0 0.970 0.264 0.3035 0.246 0.321 187.4 ; 60.2 
dity 8.25 550.0 0.722 0.189 0.3904 0.183 0.254 150.9 38.3 
6.25 705.0 0.418 0.101 0.6245 0.106 0.172 114.3 19.7 
* awl g logs, 
(25) a= 4 dlogFm 
Substitution of these into Eq. (27) yields CONCLUSIONS 
r dm The results of this paper show that the viscometer 
F — described is capable of giving excellent results when 
(26) ( m)_ es * ae 1 @ used with either Newtonian or non-Newtonian fluids. 
Fn BL 4(2.303) m? When used with Newtonian fluids, absolute viscosities 
(30) may be measured with high accuracy without requiring 
_ the ml 1 dm a density determination. With non-Newtonian fluids 
) is — B 1 9.212m? di the flow curve may be determined over a wide shearing- 





stress range from the observations made during a single. 
In this method logio/ is plotted vs ¢ and differentiated run. Furthermore, by providing means for using lighter 
to give m. The m values are then plotted vs ¢ and the 











(27) plot again differentiated to give dm/dt. These deriva- 2000}- e 
tives are substituted into Eqs. (29) and (30) to find ¢a ° 
sgh and g(F,)/Fm. The steps in this procedure are illus- an i . 
jelds | tated by the data given in Table II, which were de- mY : 
nF,,. | ‘duced from the measurements shown in Fig. 3. 
) can Finally, Fig. 4 shows the flow curve of the latex as S seooh- — 
. the | ‘deduced from measurements in two capillaries of differ- . ° 
ent radii. The fact that the two sets of data lie on a _ e 
x the single curve confirms the theory of the instrument as 5 : 
applied to non-Newtonian fluids, and indicates that a i ° 
excellent results can be obtained with it. © col - 
Of the two methods discussed for obtaining the flow 4 
(28) } curves of non-Newtonian fluids, the second or differ- & oxo}- : ® 
ential method is exact, while the incremental one is oe 
approximate. When the deviation of a fluid from New- ae ° 
tonian behavior is not too pronounced, the two methods an aa 
give identical results, and the incremental one is to be °° 
(29) | preferred since it involves only one graphical differen- ol wt i —_—). a ats a a 





tiation. However, when the fluid deviates appreciably Fa— SHEAR STRESS IN DYNES /CM* 
1954). from Newtonian flow, the differential method may be Fic. 4. Flow curve for 62.2 percent solids latex in 
required to arrive at the relation between g(F,,) and Fn. two capillaries at 30°C. 
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liquids as well as mercury as drive fluids, the viscometer 
offers the means of investigating the low as well as the 
higher shearing-stress ranges, and to ascertain thereby 
more precisely whether yield points exist in the systems 
investigated. 
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APPENDIX 


The only measurement made here which required a 
kinetic energy correction was that of water run with 
the mercury manometer. This correction was made as 
follows. Under conditions where the kinetic energy is 
significant, the pressure drop across the capillary is 








KRIEGER, 





AND SISKO 


where p, is the density of the liquid measured and 1 isa 
constant which may be taken as 1.15. Substituting 
P=ah and Q= —2rdh/dt, we get 











8Lr'n dh np.r* dh 
eee [1 “| 
R* dt 8Ln dl 
which on integration yields 
nap,R*h 
(togis+ ) =— Bi+C (32) 
64(2.303)L*n? 


where B is given by Eq. (22) and C is a constant of 
integration. By estimating » from the low values of h, 
and using it along with the density, the quantity on 
the left of Eq. (32) can be evaluated and plotted vs ¢. 


‘Least squares calculation of the slope of the resulting 


straight line allows then the evaluation of 7. 





given by 
8LOn np.” 
P= + (31) 
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Physical Theory of New Circuit Representation for Junction Transistors 


Jakos ZAWELS* 
Radio Corporation of America, Victor Division, Camden, New Jersey 
(Received November 9, 1953) 


A circuit representation for a junction transistor is derived which places the mechanism of transistor 
action in evidence. The circuit is a direct interpretation of the diffusion equation and the boundary con- 
ditions which include the effects of base width modulation. It is found that the active part of the circuit 
is independent of frequency. A modification of the circuit for common emitter operation is shown and 


experimental results are given. 


I. INTRODUCTION 


HE physical properties of junction transistors have 
largely been described by Shockley,' Sparks,’ 
and Teal.? The diffusion equation is the basic equation 
from which Shockley e/ al. derived some of the expres- 
sions for the electrical parameters. Early* extended this 
theory by stressing the importance of space charge layer 
widening at the collector junction and thereby accounted 
for the “collector resistance” of the 7-equivalent cir- 
cuit. However, it is possible to define the boundary 
conditions for the diffusion equation in a manner which 
will simultaneously account for all transistor parameters 
which have their basis in the diffusion process. 

In either case, whether the Early effect is included or 
not, the use of the diffusion equation and the physical 
shape of a junction transistor implies a unique circuit 

* Present address: Michaelandré House—Apt. 103, 133 Celliers 
Street, Pretoria, South Africa. 

1 W. Shockley, Bell System Tech. J. 28, 435 (1949). 


? Shockley, Sparks, and Teal, Phys. Rev. 83, 151 (1951). 
3J. M. Early, Proc. Inst. Radio Engrs. 40, 1401 (1952). 


in which each element may be associated with each 
individual phenomenon present. Obviously, this circuit 
could be transformed by various circuit manipulation 
into any number of circuits of arbitrary topology, such 
as a T, m, lattice, etc. However, such a transformed 
circuit suffers from two main disadvantages. Firstly, 
the elements lose their individual meaning (for example 
it is difficult to associate the role of the current gener- 
ator in the 7-equivalent circuit with an individual 
mechanism in the transistor). Secondly, a transformed 
circuit inevitably must employ gross approximations in 
order to limit its complexity. 

In this work an equivalent circuit is derived which 
is a direct interpretation of the diffusion equation and 
the boundary conditions. 


II. THE DIFFUSION EQUATION AND THE 
BOUNDARY CONDITIONS 


Figure 1 shows a cross section of a plane parallel 
p-n-p junction transistor having a uniform base thick- 
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NEW CIRCUIT REPRESENTATION FOR JUNCTION TRANSISTORS 


ness W and connected to batteries Vg and Vc. The 
emitter and collector space charge regions are also 
shown. An X, Y coordinate system is indicated with its 
origin at the emitter junction. The behavior for holes 
injected into the base is described by [reference 1, 
Eq. (4.7) ], 








Pp Pa—p ap 
ot sit caiins 


D; ; 
Ox" T. al 


(1a) 


where D, is the diffusion constant, p is hole density, 
tp their lifetime, and p, is the equilibrium value of 
holes in the # region. 

(1) can be split into a de and an ac equation by assum- 
ing a solution: 


p(x,t) = Po(x)+ pi(x) exp(jwl) 


which yields a de equation 











dP Pe- Pu 
- (1b) 
Ox" L; 
and an ac equation, 
Pp pi 
=— (1+ jwr,). (1c) 
af LL? 


The resemblance of the above equation to a trans- 
mission line differential equation is evident. However, 
the exact transmission line analogy will be established 
later. 

The dc boundary conditions for Po, that is the hole 
flow across the junction at x=0 and x+=JW, depend on 
the potential Vg and V¢ across the junction. 

Thus! at «=0, 

Po(0) = pre*”® (2a) 
and at x=W 
Po(W)= pre (2b) 
where 
A=@/kT. 


Using these boundary conditions, the solution to (1b) is 
W 


a 
(e-4Ve om ese) ? 


n sinh | ——— 
W 4 L, 
2 sinh (—) 
Ly 


(e-4 Vo+ es ; 2) 





Py(x)= 


[se 


x—— 
2 
pn cosh |-——-|+ pn. (3) 
W Le 
2 cosh(—) 
Ly 
The boundary conditions for the solution of (1c) are 


given by a perturbation on the dec solution (3) as a 
result of a change in Vg or Vc. 
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Fic. 1. Cross section of parallel plane junction transistor. 


Thus, the ac boundary conditions are at «=0 


OP» 
pi (0) =——+,= p, Ae‘ £2, (4a) 
OVe 
and at x=W, 
OP» 
pi(W)=—», (4b) 
OVe 


where v,=small signal emitter voltage, v.=small signal 
collector voltage. 

In evaluating p:(W) we note that W is a function of 
Vc due to the Early effect. Furthermore, at x=W, 
P» [in Eq. (3)] virtually vanishes especially for col- 
lector voltage Vc> 4 volt. Hence, the boundary con- 
dition (4b) is rewritten 

dP (x) 
p.i(W)=——_ 


Ox |\zmW Ve OVe 


From (3) and (4c) 


OW ss AP, (W) 


-_— 17 ———. 





v. (4c) 








nhesVe 
p.(W)=———+, (4d) 
where 
1 i ow W 
—=| -—— cosech— 
K LAL, dVc L, 


eAvcy 1 OW W e-4¥e 
-( : ) —— coth—+-— | (5) 
eAVE SAL, Vc L, os 











The first term in (5) is much larger than the other two 
terms which involve e~“"¢ for Vc>0.5 volt. Stating 
Eq. (5) in words: K is the ratio of the boundary 
values of the incremental hole densities, (p:(0) and 
pi(W)), at the emitter and collector junctions, respec- 
tively. 

Using boundary conditions (4a) and (4d) the solution 
to the ac equation (1c) may be found. Furthermore, it 
follows from the definition of the diffusion constant 
that the current due to the concentration gradient of 
holes is 

Op 
t»= —@D—- 
Ox 
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Fic. 2. (a) Block diagram of intrinsic transistor. (b) Equiva- 
lent circuit of intrinsic transistor. (c) Equivalent circuit of junc- 
tion transistor. 


Hence the currents due to holes at the emitter (x=0) 
may be written 


, 





QP mbt W 
i,’= an a v, coth—(1+ jwr,)! 
“p Ly 
Ve W 
—— cosech—(1+ jwr,)'} (6a) 
K | = 
and at the collector («=W) 
QP nt W 
i’ = “eAVE(1+4 jor) v. cosech—(1+ jwr,)! 
L, | 
Ve W 
—— coth—(1+ jwr,)! (6b) 
K L 


“p 


The ac electron currents which flow in a p-n-p tran- 
sistor are virtually unaffected by the base width modu- 
lation of the collector voltage. It may be written by 
inspection with the help of (6a) and (6b) as follows: 





Qu phn os - 1 
if'= zone £(1+ jwr,)'v, (7a) 
and 
QNp Mn 
j= = eAVC(14 jarrn’)M0¢. (7b) 


n 


The primed quantities ,’ and 1,’ refer to the collector. 

The total emitter and collector currents flowing now 
are the sums of (6a) and (7a) and also (7a) and (7b), 
respectively. 
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Ill. THE INTRINSIC TRANSISTOR 


The node equations for a four terminal network are 
written generally. 


i= M1 V1 + V1202 (8a) 
io=) V21 Vit Vo 2. (8b) 


(Currents flowing into positive terminals are considered 
positive.) 

Identifying subscripts 1 and 2 with ¢ and c, and com- 
paring coefficients with the total currents flowing, it is 
seen that 














V11= GO cothd+G¢@ (9a) 
Vo1= — G6 coseché (9b) 
v12= —— cosechd (9c) 
G#é eA Ve 
=— cothd+ G¢’—_,, (9d) 
K eAVE 
where ; 
q Giz 
G=—p.mye'™! (. =) (10a) 
W kT 
W . 
=—(1+ jwr,)} (10b) 
Ly 
D,n,W 
o= )} (10c) 
Dyprln 
D,.n,'W 
¢'= (1+ jwrn’)!. (10d) 
Dypnbn' 


K, as defined in Eq. (5), is a frequency independent 
number and may be in the order of 10 000. 

Consider Eqs. (9a) to (9d). For the moment, neglect 
the terms containing ¢ and ¢’. Now it is seen that 
yoo/K=yx, and y32/K= ya. This is interpreted to mean 
that there is present a symmetrical passive .network 
(where yi1=Ye22 and yi2=ye1) to which an impedance 
multiplier K is connected as shown in Fig. 2a. On closer 
examination it is found that this impedance multiplier 
which we will call a ““K amplifier” has the properties: 


output voltage 





input voltage 
and 
output current 





: = Unity. 
input current 


In fact the properties of a K amplifier are identical to 
those of a grounded grid (vacuum tube) triode whose 
plate resistance is zero and whose amplification factor 
p=K—1=K. 

Also, the passive network is an RC transmission line 
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NEW CIRCUIT 


having a characteristic impedance, 





z\?i 1 1 
Zo= (-) =—- (11a) 
y Gé W 
G—(1+ jwr,)! 
Ly 
and a hyperbolic angle 
W 
(zy) ¥=0=—(1+ jwr,)}, (11b) 
Ly 
where 
z=r+ jwl=series impedance per unit length, (11c) 


y=g+ jwC=shunt admittance per unit length, (11d) 
and 


1=length of line. (11e) 


Substituting (11c) and (11d) into (11a) and (11b), 
it follows that the transmission line has zero series 
inductance while its total series resistance rl=1/G, the 
total shunt conductance gl=GIV?/L,” and the total 
shunt capacitance C1=GW?r,/L,’. 

So far, terms involving ¢ and ¢’ in Eqs. (9a) and (9d) 
have been neglected. This is easily accounted for by 
two impedances placed in parallel at each end of the 
circuit as seen in Fig. 2b. Here 


1 D,n,W 
—=G6=G 
Zon Dyprln 





(1+ jwr,)}. 


This impedance Zp, is, for a good transistor, very large 
compared to the characteristic impedance of the line Zo. 
Also, 
1 gaan 
/ 


, 
Z'on 








eAVE 


and is usually entirely negligible compared to the other 
conductances which it shunts, especially for collector 
voltages greater than 0.5 volt. Figure 2b shall be re- 
ferred to as the intrinsic transistor. 

It is seen that the transmission line represents the 
passage of the minority carriers through the base. The 
K amplifier represents the falling of the minority car- 
riers down the potential hill, thereby resulting in volt- 
age, but no current, multiplication. Zo, (and also Zo,’) 
represents the impedance presented to the flow of 
majority carriers from the base to the emitter (and also 
to the collector). 


IV. PRACTICAL TRANSISTOR 


In order to obtain the circuit representation for a 
practical transistor additional elements must be added 
to the intrinsic transistor as shown in Fig. 2c. The 
most important of these are the collector barrier, or 
transition capacitance' Cyc, and the base spreading 
resistance 5, (which is due to the bulk resistivity of the 
germanium). A less important effect for a good tran- 
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Fic. 3. Three-terminal device LMN represented by two circuits 
which are approximately equivalent if yi1.>>yi2 and yo>>ye2. 


sistor is r,, which is the collector leakage resistance due 
to the extraneous matter shunting the collector junc- 
tion. Also, for present-day transistors, Cre, the emitter 
barrier capacitance, is very small compared to the 
capacitance of the transmission line. Furthermore, 
should surface recombination exist (on the surface of 
the base region), as is often the case with alloy tran- 
sistors,! an admittance Y sr should be added probably 
in parallel with Zon. 

Finally, possible effects like the modulation of the 
base spreading resistance rj, by virtue of collector 
barrier width modulation,’ and ohmic series resistance 
in the emitter and collector due to poor lead contact, 
or the material spreading resistance, have been omitted 
either because they are normally negligible or because 
they are experimentally difficult to distinguish from the 
other elements that are present. 


V. GROUNDED EMITTER CIRCUIT 


For the case where the emitter is the common lead 
(in some amplifier applications) difficulty may be en- 
countered if Fig. 2c is used. Thus, we shall modify this 
circuit slightly by the use of the following circuit the- 
orem without losing the physical meaning in the re- 
sultant circuit. 

Consider a network pgr as seen in Figs. 3a and 3b. 
Let a phase reversing transformer and current generator 
iw ne connected to pgr as seen in Fig. 3b. Now, the 
electrical characteristics as viewed from terminals L, 
M, and N are virtually identical for the two devices 
shown in Figs. 3a and 3b, providing that the network 
pr has the following properties 


yu>>yie” and Yu>>YV22 (12) 


where p and gq are regarded as the input leads, while 
r and g are the output leads. 

It now follows that a device may be represented by 
two circuits which differ only in the respect that one of 
them employs a phase reversing transformer and a 
current generator, whose current is proportional to the 
current in the output lead, providing that the inequality 
(12) holds. Applying this theorem: Fig. 2a represents 
a grounded base circuit, while Fig. 4a represents a 
grounded emitter circuit; both employ identically the 
same passive network and K amplifier. 

It is obvious that the sequence of the placing of the 
passive network, reversing transformer, and K ampli- 


4A. R. Moore and J. I. Pankove (private communication). 
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Fic. 4. (a) Common emitter block diagram. (b) Equivalent 
circuit for common emitter connection. (c) Alternate represen- 
tation of circuit shown in Fig. 4b. 
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Fic. 4d. Pictorial association. 


fier is immaterial provided that the impedance trans- 
forming properties of the amplifier are recognized. This 
is a result of these three sections being simply in 
cascade. 

Since the inequality (12) holds for both the intrinsic 
transistor (Fig. 2b) as well as the practical transistor 
(Fig. 2c), there is a freedom of choice as to where the 
external elements of the practical transistor should now 
be connected. Thus, Cc which is connected in parallel 
with the amplifier, between point C and B’ in Fig. 2c, 
may be connected either in parallel with the K ampli- 
fier as shown in Fig. 4b or between point C and B’ as 
shown in Fig. 4c, depending on whether the circuit 
theorem is applied to Fig. 2c or Fig. 2b, respectively. 
Similarly, there are two choices for the placing of 5», 
one of which is shown. 


The sequence of the elements may also be rearranged 
as shown on the left side of Fig. 4d. Now, the flow of 
holes and electrons may be traced in both the pictorial 
representation of the transistor and the equivalent cir- 
cuit, and to a large extent a one-to-one correspondence 
can be established between each element on the left and 
a particular phenomenon to be found in the actual 
transistor on the right. 


VI. EXPERIMENTAL 


In order to determine the elements of a four terminal 
device, four independent measurements as a function 
of frequency are required. Those shown in Fig. 5 are 
for the common emitter connection of an alloy junction 
transistor. 

The magnitude of the input impedance and its phase 
angle for a short circuited output is shown in Fig. 5a. 
The magnitude of the short-circuit current amplifica- 
tion factor and its phase angle is shown in Fig. 5b. The 
equivalent parallel output resistance and capacitance 
for an ac open-circuit input is shown in Fig. 5c. The 
magnitude of the reverse voltage amplification factor, 
and its phase angle, for an ac open circuit at the input 
is shown in Fig. 5d. The small circles are experimental 
points while the solid curves are calculated® after the 
hyperbolic functions of Eqs. 9(a) to 9(d) are approxi- 
mated,® relying on the fact that for a useful transistor 
W?/L,°<1. 

For this transistor it was found that at the higher 
frequencies a small capacitance C; of 4.4 micromicro- 
farad existed between the collector and the base, prob- 
ably due to socket capacitance as well as due to the 
distributed nature of the base material in the vicinity 
of the collector transition capacitance. In Fig. 5d the 
solid line includes the effect of C; in the calculated curve 
while the dotted line excludes it. 


From Figs. 5a to 5d it is found that the first-order 


elements of Fig. 2a (or Fig. 4b) are: r,,=310 ohm, 
Crc=49 mmf; r,= 24 megohm; K = 8500; rl=15 ohm; 
1/gl=255 ohm, Cl=0.042 microfarad, where r, g, and 
C are the resistance, conductance and capacitance per 
unit length and / is the length of the transmission line. 


VII. CONCLUSION 


A circuit representation of a junction transistor has 
been derived which places in evidence the mechanism 
of transistor action. Thus for example, it is seen that 
the passage of minority carriers through the base and 
their drop through the collector potential barrier with- 
out multiplication can intuitively be associated with 


5 The curves are calculated by assuming that the calculated 
and experimental value at 1 kc coincided. Hence, the curves aim 
primarily to show the correctness of the elements used in the 
equivalent circuit and their relative values. This assumption (at 
1 kc) is justified by the closeness of the magnitude of the physical 
constants, such as base width which this implies, to their esti- 
mated values. 

6 The degree of approximation is well within the experimental 
error. 
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the flow of current through the transmission line and 
the K amplifier, respectively. 

Furthermore, it is found that the active part of the 
circuit, the K amplifier, is independent of frequency, 
and a variation of the circuit, which has the emitter 
as a common element employs a current generator 
which is also independent of frequency. 

It is also noted that even in the absence of any 
modulation of the base width by the collector voltage, 
the circuit is unaltered except for the number K. 

It is easily seen that the separation of the active and 
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Fic. 5a. Input impedance 1/ypp. 
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Fic. 5b. Short-circuit current amplification ac». 
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Fic. 5c. Open-circuit output impedance cc. 
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Fic. 5d. Reverse voltage amplification je. 


passive part and their cascaded layout simplifies the 
employment of the circuit in the calculation of gain, 
impedance levels, etc. Obviously, the transmission line 
may be approximated by lumped elements. In fact, the 
complexity of the circuit can easily be altered to meet 
the degree of accuracy required for a particular calcu- 
lation. In its complete form the circuit represents the 
diffusion equation and the boundary conditions without 
approximation. 
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Experimental Procedures for Determining the Efficiency of Four-Terminal Networks* 
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(Received November 6, 1953) 


Five experimental procedures which may be used for determining the efficiency of four-terminal networks, 
by means of circular loci plotted in the impedance or reflection coefficient planes, are presented. 





INTRODUCTION 


IMITED discussions of this subject have been 

written by Barlow and Cullen,' Beatty,? King,’ 

and Macalpine.t However, none of these references 

present methods for determining the efficiency of four- 

terminal networks which are satisfactory for all types of 
networks. 

The procedures which are presented here were de- 
veloped from well-known four-terminal circuit theory. 
No simplifying assumptions were made. Either an 
impedance bridge or a slotted line can be used for ob- 
taining data. Graphical methods are used for handling 
the data. The theoretical basis for these procedures is 
given in the Appendix. 

The arbitrary four-terminal network shown in Fig. 1 
will be considered. Phasors will be denoted by bold face 
type, e.g. E. The magnitudes will be denoted by either 
E or |E}. 


PROCEDURE I 


This procedure can be used when an impedance 
bridge is available and it is convenient to replace the 
load impedance Zz by at least three different values of 
reactance. It is not necessary to know the values of 
the reactances. 

The load impedance Zz, is connected to the four- 
terminal network and the input impedance Z, is meas- 
ured. The load impedance is replaced by a reactance 
and the input impedance Z, is measured. This is con- 
tinued until at least three different values of Z, are 
obtained. 

The normalized values of Z4 and Z, are plotted on a 








z; Le ole 
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Fic. 1. A four-terminal network with external connections. 


* This paper was written while the author was employed on a 
research project sponsored by the U. S. Air Force under contract 
No. AF33 (038)-10405, Supplementary Agreement 56 (53-606). 

'H. M. Barlow and A. L. Cullen, Micro-Wave Measurements 
(Constable and Company, London, 1950), pp. 216-223. 

Os Ri W. Beatty, Proc. Inst. Radio Engrs., 38, No. 8, 893-7 

950). 

*D. D. King, Measurements at Centimeter Wavelength (D. Van 
Nostrand Company, Inc., New York, 1952), pp. 99-105. 

*W. W. Macalpine, Elec. Eng. 72, No. 10, 868 (1953). 


Smith chart,® or a Z—8@ chart, as shown in Fig. 2. These 
impedances can be normalized with respect to any con- 
venient value of Ro. A circle K is drawn through the 
values of Z,. A straight line is drawn through Z,4 and 
the center of the circle K. The efficiency 7 is given by 
the formula 
2D,D2 
— (1) 





7= b J 
(Dit D2) (1—D;?) 






CIRCLE K 





SMITH CHART 


Fic. 2. Diagram for Procedure I. 


where D,, D2, and D; are measured as fractions of the 
radius of the chart. 


PROCEDURE II 


This procedure can be used when a slotted line is 
available and it is convenient to replace the load im- 
pedance Z, by at least three different values of reac- 
tance. It is not necessary to know the values of the 
reactances. 

The load impedance Z, is connected to the four- 
terminal network, the VSWR in the slotted line feeding 
the network is measured, and the position of a voltage 
minimum is located. The load impedance is replaced 
by a reactance, the VSWR is measured, the position of a 
voltage minimum is located, and the distance / from 
the voltage minimum for Z; to the voltage minimum 
for the reactance is measured. This procedure is con- 
tinued until at least three different reactances are con- 


5 P, H. Smith, Electronics 12, 29 (1939); 17, 130 (1944). 
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nected to the network. It is necessary to assume a posi- 
tive direction for /. The wavelength X, in the slotted line 
is either measured or calculated. 

The values of K; and K, given by the formulas 


VSWR-1 
L=—————_- |0° (2) 
VSWR+1 
and 
VSWR-1 
K,=————_ | (l/,) 72° (3) 
VSWR+1 ° &°}32@#&~3 


are plotted on polar graph paper as shown in Fig. 3. 
A circle K is drawn through the values of K,. A straight 
line is drawn through K, and the center of the circle K. 
The efficiency is given by Eq. (1). 


PROCEDURE III 


This procedure can be used when an impedance 
bridge is available and it is convenient to replace the 











270° 


Fic. 3. Diagram for Procedure IT. 


load impedance Z, by a short circuit and an open circuit. 
First the load impedance Z, is connected to the bridge 
and its power factor cos@ is determined. 

Next Zz, is connected to the output terminals of the 
four-terminal network and the input impedance Z, is 
measured. Then Z, is replaced by a short circuit and the 
input impedance Z,, to the network is measured. 
Finally, the short circuit is replaced by an open circuit 
and the input impedance Z., is measured. 

The normalized values of Z4, Z,-, and Z,. are plotted 
on a Smith chart, or a Z—@6 chart, as show in Fig. 4. 
The efficiency is given by the formula 

2d,dz cosé 
ee oe (4) 
d3(1 = d,?) 


where dj, ds, d3, and ds are measured as fractions of the 
radius of the chart. 








SMITH CHART 
Fic. 4. Diagram for Procedure ITT. 


PROCEDURE IV 


This procedure can be used when a slotted line is 
available and it is convenient to replace the load im- 
pedance Z, by a short circuit and an open circuit. First 
the power factor cos@ of Zz, is determined. 

If Zz is connected to the slotted line, the value of Zz 
is given by the equation 


Z.—Ry VSWR-1 
Zi+R,o VSWR+1 





|180°+(s/d,)720°, (5) 





where Roy is the characteristic impedance of the slotted 
line, s is the distance from Z, to a voltage minimum on 
the slotted, and X, is the wavelength in the slotted line. 
It can be shown that 








1—A 
cos# = : (6) 
V¥(1—A)?+44 sin?(s/A,)720° 
where 
VSWR-1\? 
Pe - 
VSWR+1 


When Z, is replaced by a short circuit, the voltage 
minima are located at points which are m\,/2 from the 
short circuit, where ” is an integer. When Z, is replaced 
by an open circuit, the voltage minima are (w+ 4)A,/2 
from the open circuit. Consequently, Eq. (6) is also 
valid if s is defined as the distance from a voltage mini- 
mum when Z, is connected to the slotted line to a 
voltage minimum when either a short or an open circuit 
replaces Z,. 

The load impedance Z, is connected to the slotted 
line, the VSWR is measured, and the position of a 
voltage minimum is located. Then Z, is replaced by 
either a short circuit or an open circuit and a voltage 
minimum is located. The distance s between the two 
voltage minima is measured and the value of cos@ is 
calculated using Eq. (6). 
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The load impedance Z, is connected to the four- 
terminal network, the VSWR is the slotted line feeding 
the network is measured, and the position of a voltage 
minimum is located. The load impedance is replaced by 
a short circuit, the VSWR is measured, the position of a 
voltage minimum is located, and the distance / from the 
voltage minimum for Z,, to the voltage minimum for the 
circuit is measured. This procedure is repeated for an 
open circuit. It is necessary to assume a positive direc- 
tion for /. 

The values of K,, K,., and K.., given by Eqs. (2) 
and (3), are plotted on polar graph paper as shown in 
Fig. 5. The efficiency is given by Eq. (4). 

When cosé<1, the losses in the connections between 
the slotted line and Z,; may not be negligible. If 6 
denotes the efficiency of power transfer through the 
connections 


VSWR. s= (1+6Ko)/(1—5Ko), (8) 
where 


K,= (Z,—Ro), ‘(Zi +Ro). (9) 


If Z, is replaced by either a short circuit or an open 
circuit, 


VSWR;= (1+4)/ (1-8). (10) 
The corrected VSWR is 
VSWR;VSWR.w.;-—1 
VSWReorrectea = ——__ (11) 


VSWR;—VSWRos 


The corrected value of VSWR should be used in Eq. (7). 

Also when cos#<1, VSWR»; and VSWR; will be very 
large and special methods must be used to determine 
their values. Either of the methods described by Terman 
and Pettit® should be satisfactory. 


PROCEDURE V 


For some networks, it may be found that the above 
procedures give a value for the efficiency of approxi- 


90° 








270° 
Fic. 5. Diagram for Procedure IV. 


‘Ff. E. Terman and J. M. Pettit, Electronic Measurements 
(McGraw-Hill Book Company, Inc., New York, 1952), p. 140. 
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Fic. 6. Connections for Procedure V. 


mately 0/0. In such a case, it may be desirable to use an 
auxiliary network. The auxiliary network is connected 
as shown in Fig. 6. 

The overall efficiency 2 can be determined by any 
of the above procedures. Then the efficiency m of the 
auxiliary network can be determined by any of the 
procedures. The efficiency 72 of the four-terminal net- 
work is simply 


(12) 


It may be observed that Procedures I-IV are much 
simpler when Z 4 = Ro, i.e., when K;,=0. For this reason, 
it is desirable that the auxiliary network be a matching 
network. It is essential that the auxiliary network have 
a high efficiency. 


H2= 712; M1- 


APPENDIX: GENERAL THEORY 


It is easily seen that, for the arbitrary four-terminal 
network shown in Fig. 1, 


I,=E,/(Z:+Z,), (13) 
Vi=1,Z4=E,Z4/(Z:4+Z:), (14) 

and the power P; delivered to the network is 
P,\=1PR4=EFRa/|\Z:4+Za\’, (15) 


where R, is the resistive component of Z.. 
According to well-known four-terminal circuit theory, 


V:=AV.— BL, (16) 
I,=CV.—DI,, (17) 
V.=DV,-—Bl,, (18) 
I,=CV,—Al,, (19) 
with 
AD—BC=1. (20) 


It follows from Eqs. (13), (14), and (19) that 
I,=E,(CZ4—A)/(Z:4+-Za). (21) 
The power P2 delivered to the load impedance Z, is 
P,=12R,=E?R1|CZ4—A|?/|Zi+Za|*?, (22) 


where R, is the resistive component of Z,. The effi- 
ciency 7 of power transfer is 


n= P2/P\=|CZ4—A\?Ri/Ra. (23) 


It should be noted that the efficiency is independent 
of Zi. 

Now it will be assumed that the network is fed by a 
lossless transmission line and the signal generator is 
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matched to the transmission line. The signal source and 
the lossless transmission line feeding the network can 
be represented by a simple generator having a generated 
voltage E, and an internal impedance Ro. This assump- 
tion does not affect the efficiency of the network because 
it can be concluded from Eq. (23) that the efficiency is 
independent of how the network is fed. 
It follows from Eqs. (16)—(19) that 


Z4=(AZ_+B)/(CZ:+D) (24) 
and 
Zp=(DRo+B)/(CRo+A). (25) 
When Eq. (24) is solved for Zz, the result is 
Zi= — (DZ4—B)/(CZ4—A). (26) 


If Z, is replaced by an arbitrary reactance X, then 
the input impedance Z, to the network is 


Z.= (jAX+B)/(jCX+D) 
and the voltage reflection coefficient K, is 
K,= (Z,—Ro)/(Z:+ Ro) 
DR,—B+ j(CRo—A)X 
~ DRo+B+ j(CR+A)X 


(27) 





(28) 


Equation (28) can be written 
gn tine 2a-RoDRet Bt j(CRot+A)X 
ZstRo Zs+RoDRo+B+ j(CRo+A)X 
Z4+Ro DRo—B+ j(CRo—A)X 
~ Z4+RoDRo+B+ j(CRo+A)X 
Zi—Ryo  2RfDZ,4—B+ j(CZ4—A)X] 
“ZatRo (Zat+Ro)(DRo+B+ j(CR+A)X] 


When the values of (DR o+ 8B) and (DZ4—B) given by 
Eqs. (25) and (26), respectively, are substituted in 
Eq. (29), the result is 











(29) 














Za—Ro 2Ryo CZs—AZi—jX (30) 
=——_—+- ————, (3 
Zat+Ro RotZa CRot+AZpt+ jX 
Since 
Rot+Za 
Zi+Ze=— : (31) 
(CZ4—A)(CR,+A) 
RotZa 
CR,+A=— (32) 
(Zr+Zzs)(CZ4—A) 
and 
Za—Ry 2Ro(Zi1+Zs)(CZa—A)? Z~— jX 
K,= = . GS) 
ZatRo (Ro+Za)? Zet jx 
For convenience in notation, let 
K,= (Za—Ro)/(Za+Ro) (34) 
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and 
xn hort Ze) (CZ,—A)? ie (35) 
(Rot+Za)? Zat+jx 
If one lets 
jX = (1+k)/(1—b), (36) 


where k is an arbitrary complex number such that 
k=1, then 


2Ro(Zr+Zpe)(CZ4—A)? 
(Ro+Za)? 


K.,=K,— 





Z.~1~Gc+ ih 
Zet1—(Zs—1)k 





(37) 


According to a well-known conformal mapping theorem, 
K, will lie on a circle K when plotted on the complex 
plane. When the normalized values of Z4 and Z, are 
plotted on a Smith chart, or a Z—8 chart, this is equiva- 
lent to plotting K, and K,, respectively, on the complex 
plane. Thus as X varies, the locus of the normalized 
value of Z, on a Smith chart, or a Z—8@ chart, will be a 
circle K. The circle K must enclose the normalized value 
of Z4. If Zs and Zz, are conjugate impedances, the 
center of the circle K and the point corresponding to 
Z. will coincide. 

Let X; and X»2 denote the values of X for which 
K,’ is a maximum and a minimum, respectively. Let 
K,’, K»’, Ki, Ke, Zz, and Zz. denote the values of 
K.’, K., and Z, corresponding to X,; and Xe, respec- 
tively. After K, and K,’ are determined by the usual 
analytical method for finding the maxima and minima 
of a function, it can be shown that 


Ki'K,! — 2RpRzt|CZ4—A!? 





= (38) 
|\K,’—K,’| |Ro+Za|? 
It can also be shown that 
| Rot+Za|?=4RoRa/(1—-K7? . (39) 
Consequently, 
Ki'K,’ 
———=R,(1—Ky,”)|CZ4—A|?/2R,. (40) 
|K,’—K,’| 


The derivation of Eq. (40) is not given in detail because 
of its length. If Eq. (40) is compared with Eq. (23), 
it is seen that 


n=2Ki'K2'/(1—K_1”)|K,’—Ky’|. (41) 
If the values of K,’ are plotted on the complex plane, 
their locus is a circle. It is easily seen that K,’ and K,’ 
can be found by drawing a straight line through the 
origin and the center of the circle. The points of inter- 
section of this line and the circle are K,’ and K,’. Also, 


if K, is plotted on the complex plane, the locus is the 
circle K. The values of K; and K, can be found by 
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drawing a straight line through K, and the center of 
the circle. The points of intersection of the line and the 
circle are K, and Ko». 

If K,.’ and K,, denote the values of K,’ and K,, 
respectively, when X =0, then 


K,.’=K,.—K1 
2Ro(Zi+Zp)(CZ4—A)* Zi 
=— ey. ~ ae 
(Ro+Za)? Zp 


If Ky.’ and Ko. denote the values of K,’ and K,, re- 
spectively, when X= o, then 











Ky.’ =Ko-—Kz 
2Ro(Zi+Zs)(CZ4—A)’ 
= . (43) 
(Ro+Za)? 
It is easily seen that 
K,.-Koe 2RoZ_(CZ4—A)’ 
= (44) 


w’—Ko! (Ro +- Za)? 





MATHIS 


When Eq. (44) is compared with Eqs. (39) and (23), it 
is seen that 
2K.’ Ko-’ cos 


IK,.’—Ko.! | (1 — K 1”) 








2|K,.-—Kz_| |Koe—Kz|cosd 
=——____________, (45) 
K,.—Koe| (1—K 1" 





where 


cosd= R1,/Z 1. (46) 


Equations (41) and (45) can be used for determining 
the efficiency of four-terminal networks. When these 
equations are used, it is not necessary to know the 
absolute values of the angles for the complex reflection 
coefficients. It is sufficient to know their relative values. 
Also if it is more convenient to deal with impedances 
than with reflection coefficients, the impedances can be 
plotted on a Smith chart, or a Z—8@ chart, and the mag- 
nitudes of the reflection coefficients used in Eqs. (41) 
and (45) can be found by making simple measurements 
on the chart. 
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in the Electron Microscope 
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The chromatic field aberration of the magnetic lens in the electron microscope is theoretically studied. 
Good results are obtained for the coefficient of the chromatic difference in magnification of the objective, 
taking into account the inclination of the principal illuminating rays. The same coefficient of the inter- 
mediate lens increases rapidly as the magnification is reduced by the intermediate lens. The calculated 


results agree well with experimental results. 


Calculated results are given for the coefficients of chromatic difference in rotation of all lenses as well as 
for the coefficient of chromatic difference in magnification of the projection lens. 


INTRODUCTION 


N the electron microscope chromatic aberration, 

astigmatism, and spherical aberration are the most 
important defects of the image. The difference in color 
in light optics corresponds to the difference in electron 
speed in electron optics, and the inhomogeneity of the 
electron speed is caused mainly by the fluctuation of 
accelerating voltage and the energy loss of the electron 
in passing through a layer of matter. Since the chro- 
matic aberration in the electron microscope has not 
been sufficiently compensated, great efforts have been 
made to stabilize the accelerating voltage. Fluctuation 
of the exciting current of a magnetic lens has effects 
similar to those of accelerating voltage, and the exciting 
current has also been kept extremely constant. 

In our laboratory chromatic aberration has been 
studied for several years, and S. Katagiri has already 


reported his experimental results.' In this paper, the 
calculated results for chromatic field aberration of the 
magnetic lens will be described. The abstract has al- 
ready been presented at the eighth lecture meeting of 
the Society of Electron Microscopy (Japan) held in 
Nagoya University on September 16, 1952. 


CHROMATIC FIELD ABERRATION 


The chromatic field aberration consisting of chromatic 
difference in magnification and chromatic difference in 
rotation causes the deviation of each image point from 
the Gaussian image point. The deviation is proportional 
to the relative variation of kinetic energy Ad/¢ and the 
distance of the object point from the axis ro. The radial 
component 67 and the tangential component 6,, of the 
displacement referred to the object plane are repre- 


1S. Katagiri, J. Electronmicroscopy 1, 17 (1953). 
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CHROMATIC FIELD ABERRA 


sented as follows: 


Ag 

5 rm=Crm'to-—, (1) 
rn) 
Ag 

6r-=C rr, + 9° —. (2) 
vo) 


Here Cr, and Cr, are constants of proportionality 
known as the coefficient of chromatic difference in mag- 
nification and that of chromatic difference in rotation, 
respectively. 

A variation in the exciting current Ai has the same 
effect as a change in kinetic energy of the electron 
eAd if 

Ad Ai 
ee. (3) 
r70) 1 


CHROMATIC DIFFERENCE IN MAGNIFICATION 
(1) For the Objective 


If the specimen is illuminated by electron rays parallel 
to the axis, chromatic difference in magnification is pro- 
portional to the variation of the focal length of the 
lens Afo/ fo due to an energy difference of the elec- 


trons Ad/¢. 
Orm Ad Afo 
(“) = Cent = ——* (4) 
ro Fo ? fo 


Here, of course, it is assumed that the image distance 
is very large compared with the focal length. For the 
weak lens, the focal length is nearly proportional to ¢ 
and the coefficient is nearly equal to —1. However, as 
the lens strength increases, the value deviates from —1. 

Assume that the axial field distribution of the lens is 


H(0) 
H(z)=———_; (5) 


#(!) 


the focal length of the objective is given by 


Wo i (6) 
with 


e 
wo” = 1+——H?(0)a,?. 
8m J 





Here e/m is the specific charge of electron and the 
suffix 0 represents quantities concerned with the ob- 
jective. In the same manner, suffixes p and i will repre- 
sent the quantities concerned with the projection lens 
and the intermediate lens, respectively. From Eqs. (4) 
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Fic. 1. Coefficient of chromatic difference in magnification 
for the objective (I). 


and (6), the coefficient is given by 





T 
7 (wo?— 1) cos— 
Wo 
(Crm)o= . (7) 
Tv 
2wo* sin— 
Wo 


According to Eq. (5), Eq. (6), and the relation 


drni= f H (z)dz, (8) 


the variable w can be translated to the variable ni/./@ 
as follows. 


nt amp. turn 
—= 16381(6t—1)| | (9) 
Vo (volt)! 


K. Kanaya® already reported Eq. (7) and W. Glaser® 
introduced a different formula: 


, , Tv (wo? —1 ) 
(Crn)oo= ————. (10) 

us 

2wo* sin— 

Wo 


Figure 1 shows the relation between (Crm)o and wo, 
and Fig. 2 the relation between (Crm)o and (ni/./@)o. 
The experimental results show that the coefficient 
changes its sign at the value of (mi/\/@)o about 19 
amp. turn/(volt)!, while Glaser’s coefficient (Crm)o® is 


~ 2K. Kanaya, Report of the Cooperative Research Committee 
on Electronmicroscopy of Japan, No. 56-A-7 (1951) (in Japa- 
nese). 

3 W. Glaser, Z. Physik. 117, 285 (1941). 
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Fic. 2. Coefficient of chromatic difference in magnification 
for the objective (IT). 


always negative and (Cr»,)o' changes its sign at a very 
large value of (ni/\/o)o. Thus, the results calculated 
from these formulas do not coincide with experimental 
results. 

The disparity seems to be mainly caused by the 
assumption of parallel illumination. V. K. Zworykin 
and his co-workers‘ suggested that the inclination of 
the principal illuminating rays should be considered, 
but they scarcely described this quantitatively. 

The author assumes that the inclination of a prin- 
cipal ray is equal to the inclination of a ray, at the 
object plane, which is parallel to the axis at a point 
sufficiently far from the lens. 

If it is assumed that the deviation corresponding to 
A@ of the plane conjugate to the image screen is A/fo, 
it is possible to write 


, (Crm) = (1+ Ko): (Crm)o', 
with 
T T 
Wo SIN— COSwoer-+ Sinwer COS— (11) 
Wo Wo 
Ko= ——, 


sinwor ) 





Here (Crm)o! is given by Eq. (7). The results of Eqs. 
(11) approach the experimental results as seen in Fig. 2. 

It is more reasonable to use (AZ;)o—the deviation 
of the focal point corresponding to A¢—instead of A/o. 
In this case, the coefficient is given by 


(Cra)gitt= (1-K. sec” ) Crnot. (12) 


wo 


The results from Eq. (12) fairly well coincide with the 


4Zworykin, Morton, Ramberg, Hillier, and Vance, Electron 
Optics and the Electron Microscope (John Wiley & Sons, Inc., 
New York, 1945). 
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experimental results as seen in Fig. 2, where Eq. (9) is 
used for the translation of w to ni//¢. The field distri- 
bution of the lens varies with the ratio, h/d, of the 
pole-piece spacing / to the pole-piece diameter d, and 
it is hence expected that the curve of the coefficient in 
Fig. 2 varies with //d of the lens. This does not appear 
in the calculated results because of the assumption that 
the field distribution is given by Eq. (5). However, it 
is believed that a variation with //d is practically out 
of the question, since lenses with a comparatively small 
value of /:/d are usually used in the electron microscope. 


(2) For the Projection Lens 


If it is assumed that the chromatic difference in mag- 
nification depends only upon the variation of the focal 
length, the coefficient is expressed by 


wy —1 
- —+(1—w,m cotw,7). (13) 
2w,” 


(Crm) - = 


Figures 3 and 4 show the relations of (Crm),' with wp 
and with (ni/./@)», respectively. Contrary to the case 
of the objective, the calculated coefficient (Crm)! 
agrees well with the observed values as seen in Fig. 4. 

When the focal length of the projection lens is long, 
it is inadequate to consider the magnification as in- 
versely proportional to the focal length. The correction 
is given by 


1 
A(Crm) a= OV. (Crm) mS (14) 


p 


Here M, is the magnification of the projection lens and 
its sign is chosen as positive for the erect image and 
negative for the inverted image. M, is negative in 
general. 

The principal planes of the projection lens separate 
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Fic. 3. Coefficient of chromatic difference in magnification 
for the projection lens (I). 
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when the excitation is strong. A correction for the sepa- 
ration of principal planes is given by 


a 
And Eo —, 
‘ | 
with 
; (w,’— 1) (1+-cosw,7) (sinw pr— wT) 
K,/= ’ 


2w» Sin*w pr 





Here /,, represents the distance of the imaging screen 
from the projection lens. This correction need not be 
taken into consideration in practice, because @p/Ip. is 
generally small and K,’ becomes large only when the 
excitation of the lens is stronger than for maximum 
magnification. 

Correction for the inclination of the principal rays is 
considered similarly as for the objective. For the elec- 
tron microscope with the lens system which comprises 
an objective and a projection lens, the correction is 
given approximately by 








a 7 
, P 
A(Cr m) p— K ee eae 
W pd p 
— 
' tanw,7 - (16) 
with 
. . 
_,, @r—1)(sinw yr coswyr—w pm) 
—- ~ ’ 
2w, sin? 
2w» Sin*w pm J 


and for the electron microscope with three lenses, 
namely an objective, an intermediate lens, and a pro- 
jection lens, it becomes approximately 





(Moi) m Lo; 
ap 


Moi lop 
A(Crm) p= K - $$$ —__—_____, (17) 


W pp 
lio— 





tanw »7 


lop and /;, are the distance of the projection lens from 
the objective and from the intermediate lens, respec- 
tively, Jo; is the distance of the intermediate lens from 
the objective, Mo; is the magnification by the objective 
and the intermediate lens, and (Mo,)m is the value of 
Mo; when the intermediate lens is not used. The value 
of w,a,/tanw,r is usually small compared with /o, and 
l;». In general, this correction is small and need not be 
considered. 

For the projection lens, contrary to the objective, 
lenses with comparatively large h/d—the ratio of the 
pole-piece spacing and diameter—are available, as well 
as lenses with small #/d. Though Eq. (9) is utilized for 
the translation of w, to (ni/\/@),» in this study, it is 
not expected that the calculated results will agree with 
the experimental results for all lenses with various 
values of 4/d. The fact that the value of ni/\/¢ at 
which the coefficient is equal to zero becomes small as 
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Fic. 4. Coefficient of chromatic difference in magnification 
for the projection lens (IT). 


the value of //d becomes large has been proved by 
Katagiri’s experiments in our laboratory. A similar 
tendency was found by the author in the relation be- 
tween ni/\/o at minimum focal length and h/d. M. v. 
Ments and J. B. Le Poole’ reported their fine work on 
the same subject. The result of the calculation, how- 
ever, did not quantitatively coincide with that of Kata- 
giri’s experiment. 


(3) For the Intermediate Lens 


The intermediate lens can be used not only similarly 
as the projection lens, but it can be used to reduce the 
magnification. It will be described here only for the 
latter case, since in the former case the intermediate 
lens is regarded as the first projection lens. 

As the first approximation, it is assumed that the 
principal rays are parallel to the axis after passing 
through the intermediate lens. Under this assumption, 
the coefficient can be written as follows. 


7 t A i 
(Crm)it _ oe > Hah Oo (18) 
vn; Ad fi fi Ab 


In general, the intermediate lens is used at a small 
value of ni/,/¢ and the variation of the focal length is 
nearly equal to that of kinetic energy of the electron: 








~ = (19) 
fi o 
Consequently, 
(Crm)i= o=(14 ~) = i— | (20) 


5M. v. Ments and J. B. Le Poole, Appl. Sci. Research B1, 3 
(1947). 
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is obtained, where 2; is the distance of the image plane 
from the lens. 

Considering the inclination of the principal rays after 
passing through the intermediate lens, the corrected 
coefficient is given by 


1—R) 
(Cra) t'= . : 
R | 
with (21) 
Mo, 
Stes | 
(Moi)m 2 


The coefficient is represented as a function of R and 
takes a simple and convenient form. R depends not 
only on the focal length of the intermediate lens, but 
on its position, so the coefficient cannot be written as 
a function of ni/\/¢ alone. If the lens is weak and it 
is permissible to write 


1 ni \? 
—=c(—) | (22) 
fi Vol ; 


the coefficient can be written jn the form 








ni \? ) 
(=) 
os i 
(Cre) ft =—_—————_ 
ni \? 
1-1(—) ‘ (23) 
’ Vo7 i 
with 
Crlgi-v; C-loi-lip 
k= = : 
loi; lop / 


Figure 5 shows the variation of the coefficient with R 
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Fic. 5. Coefficient of chromatic difference in magnification 
for the intermediate lens (I). 
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Fic. 6. Coefficient of chromatic difference in magnification 
for the intermediate lens (IT). 


and Fig. 6 the variation of the coefficient with (ni/\/), 
for a specific instrument. 
For a strong lens, Eq. (19) is replaced by 


Afi Ad 
fi ? 
and 
(Crm) dU = p(Crm) (25) 


is obtained, where p is a function of w,; and if it is 
assumed that the field distribution of the lens is given 
by Eq. (5), it takes the form 


w?- 1 





p= _ (1—w,yr cotw,r). (26) 


Wj 
CHROMATIC DIFFERENCE IN ROTATION 
(1) For the Objective 


Image rotation due to the objective, when the field 
is assumed to have a form given by Eq. (5), is repre- 
sented by 


Z 2b 
Yo= (wo?—1)!- (cot-“—cot-), (27) 


ao ao 


where z, and z are the z coordinates of the specimen 
plane and the correctly focused image plane, respec- 
tively. Differentiating by ¢, under the assumption that 
the term inside the brackets is constant, leads to 


T (wo? — 1)} 
(Cr-)o= —-———,, (28) 
2wo 
whereas Glaser* reported 
tt (wo? — 1)} 
(Cr)o@= -———. (29) 
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Fic. 7. Coefficient of chromatic difference in rotation (I). 


In his calculation, the term inside the brackets is con- 
sidered as a function of wo or ¢. Figure 7 shows the 
relation between (C r,)o and wo and between (Cr,), and 
wy. In Fig. 8, the abscissa is translated to ni/\/@ using 
Eq. (9). 

Equation (28) is more suitable, in practice, than 
Glaser’s formula (29), as is easily seen in the figure. 


(2) For the Intermediate Lens and the 
Projection Lens 


Chromatic difference in rotation of the intermediate 
lens is quite similar to that of the projection lens and, 
therefore, it will be described here for the latter alone. 

Image rotation of the projection lens is given by 


Ww2—1)) (= (=) 30) 
ree Pa 2 /o a , 


Differentiating by ¢, as in the case of the objective, 
leads to 


m(w,’—1)! mw fle, (ni 
(Cr) p=" —=-"(=) (=) . 61) 
2 2\m Vo » 


Its behavior is shown in the above-mentioned Fig. 7 
and Fig. 8. The calculated coefficient coincides approxi- 
mately with the observed values. The coefficient (Cr,) p 
is linearly proportional to (ni//¢), and scarcely de- 
pends on the shape and dimension of the pole pieces. 


(3) Sign of the Coefficient of the Chromatic 
Difference in Rotations 


The coefficient of the chromatic difference in rotation 
calculated from Eq. (28) or (31) is always negative. 
A negative coefficient means that the tangential devia- 
tion of the image point due to an increase of the accel- 





erating voltage is in the opposite direction to that of 
image rotation. Inversion of the excitation polarity 
causes a change of the direction of image rotation and, 
consequently, a change of the direction of the tangential 
deviation. It may be possible to include the sign in the 
representation of the coefficient, but no confusion will 
result in practice if the aforementioned facts are borne 
in mind. 


COMPOUNDED ABERRATION OF THE LENS SYSTEM 


The compounded chromatic field aberration of a lens 
system, which comprises an objective, an intermediate 
lens, and a projection lens, can be represented by use 
of the aberration coefficient of each lens. 

For chromatic difference in magnification 


Ad 
brm= { (Crm)ot+ (Crm)it (Crm) p} elt”, (32) 
and for chromatic difference in rotation 
Ag 
5rr= { (Crr)o (Crr)it (Crr) p} _ (33) 


In Eq. (33), 57, is positive when the direction of the 
tangential deviation of the image point is the same as 
the direction of the image rotation due to the objective. 
Positive and negative signs are chosen when the excita- 
tion polarity of the intermediate lens or the projection 
lens is the same and opposite to that of the objective, 
respectively. Of course, it is assumed that the optical 
axis of each lens is perfectly aligned. 

Chromatic field aberration compounded of the chro- 
matic difference in magnification and the chromatic 
difference in rotation is given by 
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Fic. 8. Coefficient of chromatic difference in rotation (II). 








With a knowledge of these relations and the behavior 
of each coefficient, problems concerning chromatic field 
aberration and its compensation in a lens system can 
be easily discussed. 


DISCUSSION 


Zworykin and his co-workers® describe the chromatic 
aberration as follows. 





Awy= (Ci +-1C2)w,+Cswe, (35) 
eAd fr” 

C=-M — f IPr ar dz, (36) 
8m¢?¥ 2, 
Ad e \'? r* 

c--u-—.(—) f Haz, (37) 
26 8m zg 

eAd *b 

C;=—-M-- [reas (38) 

Smr apd’ zg 


They deal with combined electric and magnetic fields, 
whereas only a pure magnetic field is covered here. 
Aw, is the deviation of the image point from the 
Gaussian image point, w, corresponds to the position 
of the object point and ws, corresponds to the position 
at which the ray passes the aperture plane, 2=2z. 
r, and r, are the particular solutions of the paraxial ray 
equation 


eH? 


#+—1r=0 (39) 
8m 


and satisfy the boundary conditions 


Taga (zy) =0, fag=Ta (z,)=1, (40) 
and 
Tyg=Ty(Z)=1, rye=ry(2z8)=0, (41) 
respectively. 
Substituting Eq. (5), 
2g  ) 
cot '—=-, 
a@ @ 
2b 
cot-'—=0, (42) 
a 
and 
2p OT 
cot~-—=-— 
a@ 23 





6 See reference 4, p. 570, 
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in Eqs. (36), (37), and (38) leads to 











wT 
1 (w*— 1) cos— 
MAd 2 
C\=—- ‘— —, (43) 
ro) wr 
2w” sin— 
2 
M Ad Tt (w"— 1)! 
Co=— ° , (44) 
rH) 2w 
MAd~ t(w’—1) 
3=— ° (45) 
79) Tr wr 
2w* sin— sin— 
(69) 


The first term in Eq. (35) depends only on the posi- 
tion of the object point and vanishes when the object 
point is situated on the axis, and the second term de- 
pends on the position at which the ray passes the aper- 
ture plane, namely on the ray path. C; is ascribed to 
chromatic difference of magnification, C. to chromatic 
difference in magnetic rotation, and C; to chromatic 
difference in image position. 

These results are convenient in the case of dark field 
images in which the full free aperture is comparatively 
uniformly illuminated by the scattered electrons and 
the formation of image depends on the scattered elec- 
trons. In this case, the second term of Eq. (35) repre- 
sents the disk of chromatic confusion and the first term 
represents the deviation of the center of the confusion 
disk from the Gaussian image point. 

In many cases of the bright field image, however, 
scattered electrons form a very narrow pencil and 
illuminate only a part of the aperture. Electrons, which 
pass near by the object matter, also contribute to the 
formation of the outline of the image. In these cases, 
chromatic image defects observed in practice are differ- 
ent from those of the above-mentioned case. Namely, 
Arm given by Eq. (46) is observed as the chromatic 
difference in magnification. 


Arm=Cyw,t+C3wp*. (46) 


Here w * is the wz of the principal ray. 

If the principal ray is assumed to be a ray which is 
parallel to the axis at the point sufficiently far from the 
lens, 


ad sin(we—wr) 
r*= (47) 
sing 
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cotg=-, 
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Arm is given by 


MA@o@~ z(w’*—1) coswr 











Arn= aaa "Tn" Sagres 
ri) 2w? sinwa 

( —_———. 
w COSwr sin—-+ sinw7 Cos— | 
MAd w w | 
- ‘41— L 

79) T 

sinwmr cos— 

(>) 


vs 
1 (w*— 1) cos— 


w 
x ‘ (48) 
T 
2w* sin— 
w 





This result is quite the same as Eq. (12). If the aper- 
ture plane is chosen appropriately at the rear focus, 
Eq. (48) is derived directly from Eq. (36), but it must 
be considered that the position of the aperture plane 
varies with reference to the lens strength. 

Chromatic aberration can also be solved by using a 
particular solution, 7s, instead of r,, which satisfies the 
boundary conditions’ 


Tp (z5) =1, ig (Zy) =0. (49) 


Chromatic aberration is then given, corresponding to 
Eqs. (35), (43), (44), and (45), by 








Aw,= (Cy'+ iC 2')wot C3'ti, (50) 
T 
1 (w’— 1) cos— 
M Ao w 
C,'= —— . (51) 
v0) T 
2w* sin— 
w 
MAd r(w—1)! 
C./=— . =Cs, (52) 
re) 2w 
MAd ar(w?—1) 
C= —-—__-___—__, (53) 
T 
2u* sin so 
w 


where w, represents the inclination of the ray at the 
object point. Here, Ay,, may be determined by 


Arm=Cyw,tC3'w,*. (54) 


Equation (48) can also be derived from Eq. (54), 
assuming the ray given by Eq. (47), since 


qT T 
w,*= -a(« coswmr sin—-+ sinwm cos”). (55) 
o) Ww 


It may be convenient, for the case of the bright field 


7K. Kanaya, Bull. Electrotech. Lab. (Tokyo) 15, 86 (1951) 
(in Japanese). 
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image, to write the chromatic aberration as follows: 
Aw,= ci'+cl— aug Cy (Ww —w,*). (56) 
Wg 


The first term of Eq. (56) corresponds to the chromatic 
field aberration and the second term to the axial chro- 
matic aberration, where w,—w,* is the inclination of 
the ray with respect to the principal ray. 

The coefficient of axial chromatic aberration intro- 
duced by W. Glaser* corresponds to C;’. His coefficient 
of chromatic difference in magnification has a different 


physical meaning because of the disregard of the fixed 
image plane. 


CONCLUSION 


For the chromatic difference in magnification of the 
objective, it is clear that the inclination of the principal 
illuminating ray should be taken into consideration. 
Though the assumed field—Eq. (5)—is not identical 
with practically realized distributions, it seems that the 
error of the results is not important. For the case in 
which the holes of the upper and lower pole pieces differ 
in diameter, it is necessary to calculate with an assumed 
field different from Eq. (5). The assumptions of the 
calculation for inclined principal rays appear to be 
reasonable for present-day electron microscopes. 

For the projection lens, it is sufficient to consider 
that the chromatic difference in magnification is pro- 
portional to the variation of the focal length. In gen- 
eral, the correction mentioned in this paper need not 
be taken into consideration. The variation of the co- 
efficient with the pole-piece diameter and spacing must 
now be inferred from experimental studies; it is to be 
hoped that a simple and practical method of calcu- 
lating this variation will be found. 

For the intermediate lens, the coefficient of the chro- 
matic difference in magnification is always positive and 
increases rapidly from zero as the reduction ratio of 
magnification by the intermediate lens decreases from 
unity. In the three-stage electron microscope, complete 
correction of the chromatic difference in magnification 
cannot be attained unless the reduction ratio of magni- 
fication is greater than 3. The permissible range of the 
ratio is more restricted when the objective and the pro- 
jection lens are strong. 

Granting that Glaser’s coefficient of chromatic differ- 
ence in rotation of the objective has some physical 
meaning, it cannot be used in practice. Experimental 
values smaller than those calculated may result from the 
magnetic potential drop in the iron yoke of the lens. 
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The statistics of disordered phenomena as exemplified by Einstein’s theory of the Brownian motion is 


applied to the flow of fluids through porous media. 


It is shown that such a statistical treatment of the hydrodynamics in porous media automatically explains 
some well-known phenomena in a more satisfactory manner than do capillaric models. The statistical theory 
leads to a differential equation of motion of the fluid which is a modification of that of Darcy; notably a new 
macroscopic quantity is introduced which is termed “‘dispersivity.” This quantity is indicative of the 
sideways dispersion which a stream of fluid undergoes when it is passing through the porous medium. Under 
certain statistical assumptions outlined in the paper, the dispersivity becomes a constant of the porous 


medium. 


The new differential equation of motion of the fluid is discussed in detail and some indications about 


applications are given. 


I. INTRODUCTION 


HE flow of fluids through porous matter is usually 
described by empirical laws of which the so-called 
law of Darcy in the form given by Muskat' is one of the 
best known. However, such empirical laws do not 
enable one to understand properly what is taking place 
in the porous medium, unless they can be founded upon 
more fundamental facts about flow through small 
openings. 

It was realized long ago that Darcy’s law must ulti- 
mately be deducible from the laws of viscous flow. 
Nevertheless, attempts of such a deduction have thus 
far been unsatisfactory. These previous attempts were 
mostly based (more or less openly) upon the construc- 
tion of theoretical (capillaric) models: The porous 
medium is represented by a bundle of capillaries; the 
laws of viscous flow are applied to the latter and thus it 
is attempted to deduce relationships between the various 
macroscopically measurable quantities. A survey of a 
number of such theoretical models has been given and 
their implications have been discussed by the author® 
on an earlier occasion. The agreement of the properties 
of the capillaric models with those of natural porous 
material is rather poor; it is unavoidable to introduce 
fudge factors in order to get tolerable results. This, of 
course, is an unsatisfactory procedure and nullifies the 
very purpose of introducing the models. 

A somewhat different approach to the problem was 
chosen by Kozeny.’ In the latter’s treatment, no open 
mention is made of any model. Nevertheless, all his 
considerations refer to cross sections normal to the 
direction of flow and pressure gradient in the porous 
medium—and are thus equivalent to the introduction 
of a model of parallel capillaries. Furthermore, recent 


* Published by permission of Imperial Oil Ltd., Calgary. 

1M. Muskat, The Flow of Homogeneous Fluids Through Porous 
Media (Edwards Brothers, Inc., Ann Arbor, Michigan, 1946). 

2 A. E. Scheidegger, Producers Monthly 17, No. 10, 17 (1953). 

3 J. Kozeny, Ber. Akad. Wiss. (Wien) 136A, 271 (1927). 


experimental investigationst have shown that the rela- 
tionships derived by Kozeny are quite inaccurate. In 
addition, results based on the models discussed above, 
and based especially on the Kozeny treatment, have 
been used to deduce fundamental relationships for the 
flow of fluids in porous media. This is particularly true 
for the attempts to correlate permeability and capillary 
pressure,® and relative permeability and capillary pres- 
sure.® These correlations were subject to experimental 
tests and showed rather large inaccuracies which, in 
turn, render the underlying Kozeny equation again 
more doubtful. 

It therefore seems to the writer that an entirely 
different approach to the problem should be tried. Thus, 
instead of representing a porous medium by the ex- 
tremely ordered device of a bundle of capillaries, it will 
be represented by a model of exactly opposite charac- 
teristics: A model that is assumed to be completely 
disordered. It is of course to be admitted that an 
actual porous medium will perhaps not be exactly 
represented by either type of models, but that it may 
lie somewhere in between the two. However, since the 
assumption of complete disorder appears never to have 
been rigorously investigated, it seems worth while to 
try it. 


Il. THE HYPOTHESIS OF COMPLETE DISORDER 


The hypothesis of complete disorder of porous ma- 
terial will stand foremost in all further considerations. 
Naturally, it will not be mechanics in the ordinary 
sense that can be applied to such a disordered medium, 
but one will have to have recourse to statistical methods. 
The principal methods of the statistics of complete dis- 
order have been initiated by Einstein in his paper 
on Brownian motion,’ and later were expanded by 


4C. S. Brooks and W. R. Purcell, Trans. Am. Inst. Mining 
Met. Engrs. 195, 289 (1952). 
( sd of R. Purcell, Trans. Am. Inst. Mining Met. Engrs. 186, 32 
1949). 
5 L. A. Rapoport and W. J. Leas, Trans. Am. Inst. Mining Met. 
Engrs. 192, 83 (1951). 
7 A. Einstein, Ann. Physik 17, 549 (1905). 
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Langevin’ and Smoluchowski.’ A review of the methods 
is available in several papers.!?"! 

Statistics has been applied before in a very ele- 
mentary way to flow in porous media.” In this instance, 
the process of passage of a fluid particle through a 
porous medium has been regarded as a stochastic 
process. However, this needs some explanation: In one 
particular piece of rock the path of an individual par- 
ticle is most emphatically not the outcome of a sto- 
chastic process, but entirely determined by the pre- 
vailing boundary conditions and the equations of 
motion. In steady-state flow, each subsequent “‘par- 
ticle” will follow exactly the path of the previous one, 
which is contrary to all stochastic assumptions. Thus, 
our fundamental idea is not to consider the statistics of 
flow in the first place, but to consider in lieu of one piece 
of a particular porous material, a great number (called 
‘“ensemble’’) of similar pieces with identical macroscopic 
properties and dimensions. In this way it is possible to 
define “‘averages over the ensemble”’ or ‘“‘probabilities”’ 
for which dynamical equations may be set up. In each 
individual sample, however, the phenomena are gov- 
erned by the ordinary laws of mechanics, i.e., they are 
entirely predetermined and the element of “‘chance”’ is 
excluded. 

The fluid will be considered as a continuous medium, 
of which each point has a flow-path. We shall call each 
geometrical point of the fluid a “particle,” but it should 
be understood that this has nothing to do with the 
actual molecules of the fluid. This is quite in contrast 
with the theory of Brownian motion where the statistics 
of the fluid and not those of the container are con- 
sidered. 

Thus, the fundamental problem in this paper will be 
to find the equations of motion for the probability 
density (over the ensemble) of the position of a particle. 
If one further assumption is made, which (for want of a 
better name) will be called “ergodic hypothesis,” then 
it will be possible to identify the probability density 
with the actual physical density of a “parcel” of fluid 
represented by, say, a drop of dye spreading through 
the porous medium. This hypothesis assumes that the 
geometrical conditions for the motion of a particle pre- 
vailing at a spot in a particular porous medium are 
entirely uncorrelated with those at any other spot of 
that particular piece of material. This is actually an 
exact statement of what is meant by “hypothesis of 
disorder.” Admittedly, no porous medium will behave 
in exactly that manner as the pore channels all have a 
finite length (and thus there is a correlation between 


8 P. Langevin, Compt. rend. 146, 530 (1908). 
9M. v. Smoluchowski, Ann. Physik 21, 756 (1906). 
1 W. Pauli, Stalistische Mechanik (Verein der Mathematiker 
und Physiker at Swiss Federal Inst. Technol., Zurich, 1947). 
11. Infeld, On the Theory of Brownian Motion, University of 
piney Studies, Applied Mathematics Series No. 4, Toronto 
1940). 
2 E. C. Childs and N. Collis-George, Proc. Roy. Soc. (London) 
A201, 392 (1950). 


geometries over such small pore lengths)—but this 
hypothesis is the only ‘“‘backdoor” by which stochastics 
can be introduced into the present considerations. It 
seems to the writer that this hypothesis is at least as 
reasonable as assuming capillaric flow—although it 
corresponds to the opposite limit case. 

Thus, we shall apply the statistical method as out- 
lined before to the flow of fluids through porous media. 
Unless something else is specified, we shall make the 
following assumptions throughout the following sections 
of the paper: 


(a) The porous medium is isotropic and, averaged 
over the ensemble, homogeneous. 

(b) The external forces on the fluid are homogeneous 
and time independent. This may be realized by a 
constant pressure drop along a given direction. For the 
sake of simplicity, the gravity term will be neglected. 
It could easily be incorporated into the theory by intro- 
ducing a more general external potential than the 
pressure alone. 

(c) Different parts of one sample are ‘“macroscopi- 
cally identical.” In other words, if a random selection 
of pieces is taken from only one sample, one obtains a 
series of samples forming an ensemble with which to do 
statistics. This means that a fluid particle proceeding 
through the porous medium finds the same ensemble 
probability for displacement at all points of its path. 
Under the assumptions (a) and (b) this also implies 
that the time average equals the ensemble average of 
the displacement of a particle. This is the assumption 
which has been called above “ergodic hypothesis.” It is 
a rather restrictive hypothesis but it is held that it 
corresponds physically to a reasonable limit case. 


In the final sections of this paper, hypotheses (a) and 
(b) will be relaxed; but this will be mentioned in the 
proper place. Without any further remarks, (a), (b), 
and (c) are understood to be assumed. 


III. GENERAL THEORY 


Consider a “particle” of fluid in the sense defined 
above. The probable positions of the particle at time ¢ 
(where “probable” is meant with reference to the 
“ensemble of macroscopically identical porous media” 
and not with reference to the position of the particle 
in only one sample—the position of the particle in only 
one sample is sharply defined at all times by the Navier- 
Stokes equations) is now to be calculated. 

We split the time 0 to ¢ into V small intervals 7 such 
that 

Nr=t. (3.1) 


In every time interval 7 the particle will undergo a 
displacement ~ (with components &, 7, ¢, with respect 
to a Cartesian coordinate system—vectors are denoted 
by boldface letters). The (ensemble-) probability that a 
given displacement & will take place in a time interval r, 
will be denoted by 0(&). Under the assumptions made, 
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this probability taken over the ensemble will depend 
neither on the position of the particle in the porous 
medium nor on the instant. The probability 2(&) is 
assumed to be normalized so that 


f o(E,n,t)dédndt = 1. (3.2) 


The average (over the ensemble) displacement in the 
time interval 7 will be denoted by (&),. It is given by 


(n= f bo(8) dE = (&9,8). (3.3) 


It is very awkward to deal with probability distri- 
butions whose medians are not zero so that it will be 
convenient to introduce a coordinate system in which 
the median of the probability distribution will be zero. 
We shall indicate this coordinate system by dashes. 
Thus we set 


b’= F—(E)m. (3.4) 


In this dashed system the probability for the fluid 
particle to have a displacement £’ will be 


w(z’). 


The average of &’ is zero. The function w(’) may de- 
pend on (&),, as a parameter. 

The fundamental question now is: What is the proba- 
bility wy(x’,y’,2’) over the ensemble in which a fluid 
particle with initial coordinates (0,0,0) in the dashed 
system will have coordinates x’, y’, 2’ after the time 
t=Nr? 

The problem thus appears as one of the random-walk 
type. It should be noted, however, that the problem is 
physically really very different from a random-walk 
phenomenon. We do nol consider the path of only one 
particle in only one sample (that path is rigorously 
defined by the Navier-Stokes equations)—but rather 
the probable path of particles starting out of corre- 
sponding spots in an ensemble of porous media. As 
emphasized several times before, the statistics apply to 
porous media rather than to fluid flow. 

Because of the assumptions that the porous medium 
is homogeneous and isotropic, and the field of forces 
homogeneous, the original probability density w(é’,n’,¢’) 
will be split into a product of three identical proba- 
bilities, 


w(t’,n’,¢’) =w(t’)w(n')w(¢’), (3.5) 


and because of the ergodic hypothesis, those proba- 
bilities will be identical for every time step. In the light 
of the ergodic hypothesis, there are no limitations to 
the size of r. This is, of course, somewhat of an over- 
simplification for very small 7, since the probability 
distributions w for 27 and w for 7 will be correlated. 
Nevertheless, assumption of the ergodic hypothesis 
greatly simplifies matters from a theoretical and physi- 
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cal standpoint, and describes porous media which are 
completely disordered over the smallest of intervals. It 
is therefore held as a reasonable limit case to allow 7 to 
be arbitrary. Because of the ergodic hypothesis, and 
only because of this hypothesis, the problem of deter- 
mining the probability of the position of the particle 
with respect to the ensemble becomes one of the 
random-walk type. 

Thus, in terms of probability calculus, the problem 
is formally nothing but the composition of repeated 
trials. No matter what the original distribution w is, 
the central limit theorem of the calculus of probability 
states that the distribution after V trials is Gaussian, 
provided .V is large. Thus, if we denote by o the standard 
deviation of w, it follows: 


wy (x’,y’,2”) = (24 No?)-3 


Xexp[ — (x?+ y"+2")/2No")]. (3.6) 


The quantities ¢ and 7 are constants during the 
motion (although o may depend on (£)4) so that one 
may set 

o*/r=2D, (3.7) 


where D is some factor which we shall call “factor of 
dispersion” —for a reason to be explained later. Using 
Nr=t yields, 


w(t,x’,y’,2’) = (4eD1)-3 
Xexp{— (x"+y"+2")/(4D/}, (3.8) 


and returning to undashed coordinates, 


v(t,x,y,2) = (4rDt)—! exp{ —L(x—#)? 
+(y—9)?+(s—2)?)/(4D1}. (3.9) 


This is the fundamental probability distribution de- 
scribing the journey of a fluid parcel through the porous 
medium. Of course, it is based upon all the statistical 
assumptions outlined in the previous sections. 


IV. DARCY’S LAW 


First we must find a connection between the average 
displacement (x)w=(%,g,Z) and the external field of 
forces. Because of the ergodic hypothesis one has 


(x) ay (t= Vr) = VCE) wv. (4.1) 


Here, of course, use was made of the assumption that 
the external force was assumed to be homogeneous and 
time independent throughout the porous medium. Ac- 
cording to these assumptions only, the average dis- 
placement is independent of the time and the location 
of the fluid parcel in the porous medium. 

For each step 7, the laws of viscous flow state 


(1/r)| &| = (B/x)| gradp| cosé, (4.2) 


if we denote the external force (per unit pore volume) 
by gradp and the angle between — and —gradp by 6; 


18 R. v. Mises, Wahrscheinlichkeitsrechnung (F. Deuticke, Wien, 
1931). 
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B is a certain factor indicative of the reciprocal re- 
sistance of the opening through which the fluid (of vis- 
cosity “) moves during the time step r. The particular 
value of B at every point will depend on the shape and 
size of the opening through which the fluid packet 
passes. 

The second problem is to form the average of (4.2) 
over “the ensemble.” The average (£)s will be in the 
direction of —gradp if the porous medium is assumed 
to be isotropic. Its magnitude will be equal to the 
average of the component of & in the direction of 
—gradp. The component of & in the (— grad) direction 
is equal to | £| cos#. Thus, assuming that B and cos@ are 
independent, we have, 


(E)x,/7= — (B/p) grad p(cos?6) ay (4.3) 
(V)w= — (B/u) gradp(cos6) x, (4.4) 


where (V),, is the intrinsic average velocity of the fluid; 
it is equal to (1/P) times the macroscopic filter 
velocity V (P is the porosity). The average value of 
cos?@ (which is a constant of the material) may be 
called the reciprocal of the square of a “tortuosity,” 
since (1/cos@) is only the relative length of the flow 
path per unit distance along the pressure gradient. We 
thus write 


or 


(cos?@) = 1/T?. (4.5) 


From Eq. (4.3) the fundamental probability distri- 
bution becomes in virtue of (4.1) 


v(t,x,y,2) = (44D?)-! exp[— (x+¢(B/u) 
X grad p(cos?6),)?/(4Dt) }. (4.6) 


The crest of this distribution proceeds with the 
velocity (V)« which is the same as the intrinsic average 
velocity of the fluid. The filter velocity V=(V),P is, 
therefore, 


V=—P(B/u) grad p(cos?6)w. (4.7) 
Comparing this with the so-called Darcy’s law,' 
V=— (k/u) gradp, (4.8) 
(k=permeability, u= viscosity) yields 
k= PB(cos?6) n. (4.9) 


The (cos?@),, factor in the expression (4.9) for per- 
meability has been introduced earlier by Sullivan and 
Hertel."* They arrived at it by considering the path of 
a particle chrough winding capillaries of only one sample 
of porous matter, and therefore their average is taken 
over the path of only one particle. Because of the ergodic 
hypothesis, this is the same as our ensemble average; 
the fact that the two methods agree serves as corrobora- 
tion of the present formalism. 

It is seen that a statistical theory gives one a con- 
nection between the macroscopic permeability and 


4 R. R. Sullivan and K. L. Hertel, Advances in Colloid Science. I 
(Interscience Publishers, Inc., New York, 1942), p. 37. 
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porosity, and the average of the microscopic reciprocal 
“resistance.” Thus, one might call B the “intrinsic 
reciprocal resistivity” of the porous medium. Without 
the need to specify B further, the last formula shows 
that Darcy’s law follows from a statistical theory for a 
porous medium, provided that proper assumptions for 
the statistics are made (ergodic hypothesis) and that 
the law of motion through each microscopic channel is 
such that the velocity of flow is proportional to the 
negative local pressure gradient and to the reciprocal 
viscosity of the fluid. Trivial as this statement may 
appear, it does not seem to have been proved before 
under such general conditions. 


V. THE INTRINSIC RECIPROCAL RESISTIVITY 


The quantity B cannot be specified further unless 
more specific assumptions about the porous medium 
are made. 

If it be assumed that the flow channels are circular 
capillaries of diameter 6 with the axis inclined at the 
angle @ towards the negative pressure gradient, the law 
of Hagen-Poiseuille gives for each individual capillary 


B=8/32. (S.1) 
If this be averaged, one obtains 
B=(6),/32. (5.2) 
The expression for the permeability becomes 
k= P(8*),,/(32T?). (5.3) 


This is the expression which is also obtained by 
setting up a model of noninterconnected capillaries 
representing the porous matter. 

One can rid oneself of the condition that the capil- 
laries have to be circular. For a noncircular capillary of 
circumference the quantity B becomes 


B=7°C/(32n°), (5.4) 


where C is a factor indicating the deviation of the cross 
section from a circle (C is 1 fora circle, 0 for a “‘fissure’’). 
Thus one obtains for the reciprocal intrinsic resistivity 


B= (C/32m*)(y?)w. (5.5) 


The average circumference of the channels can be ex- 
pressed by the specific internal surface area S, of the 
medium. If be the number of pores per unit cross- 
sectional area normal to the pressure gradient, one has 


4n/(cos6) y= Sa, (5.6) 
but also 


P=C'n(m/4)(8*) y/{COSO) ay = C’ ny") v/ (4 cosO), (5.7) 


where C’ is another deviation factor indicating the 
deviation of the pore cross section from a circle (C’=1 
for a circle, 0 for a “‘fissure’’). 

Thus, (interchanging (y”) and (y)x”) 


¥=nP/(C’Sa), 


(5.8) 








and 


B= (1/2)(€/C) (P*/S.°). (5.9) 
By comparison with Darcy’s law, 
k= (1/2T?)(€/C”) (P8/S4°). (5.10) 


The last equation is commonly termed ‘‘Kozeny equa- 
tion.” 

The present deduction of the Kozeny equation from 
a theory that is applicable to a disordered porous 
medium shows just how many assumptions one has to 
introduce so as to arrive at it. Not only are “deviation 
factors” introduced, but also averages such as (y*)«, and 
(y)a? are arbitrarily interchanged. The final undeter- 
mined factor C/(27*C”) depends on properties of the 
porous matter and, through a proper microscopic 
arrangement, can have values in an extremely wide 
range. 

One does not even need to take recourse to a statis- 
tical theory of the present type in order to show that 
the Kozeny equation is based on such arbitrary pro- 
cedures as interchanging (y”)« and ()«”; a simple array 
of different-sized capillaries will bear this out.’ The 
present deduction of the same fact from a statistical 
theory corroborates these earlier findings. It is, there- 
fore, not at all surprising that the Kozeny relation 
gives such poor agreement with actual experimental 
tests."* As pointed out before, the Kozeny relation is 
equivalent to a capillaric model and more than a 
qualitative description of the empirical facts cannot 
therefore be expected from it. 


VI. THE FACTOR OF DISPERSION 


In the fundamental probability distribution there 
also appears the factor D. This factor does not enter 
into the laws of average advance of a fluid parcel in 
the porous medium; nevertheless, it has another macro- 
scopic meaning. 

The Gauss distribution at the time /¢ is given by 
Eq. (4.6). It has been shown that the average position 
of the median of this distribution is 


(x(t))w= —B grad pt(cos?6),,/p. (6.1) 


Similarly, the expectation value of the square length 
(x?(¢))4 can be calculated. From the general theory of 
Gauss distributions it follows 


(x?(0))w= 2Di+[(B/n) gradp(cos’0)a!P. (6.2) 


This shows that the “dispersion” of a fluid particle 
within the ensemble is essentially dependent on D. 

In virtue of the ergodic hypothesis, one can formulate 
these results so as to make them applicable to an ex- 
tended fluid parcel within one sample of porous ma- 
terial. The proceeding of a fluid parcel through the 
porous medium could be realized experimentally: if a 
drop of dye were added to the liquid, one could watch 
the concentration of the dyestuff. Each particle of the 
drop of dye would proceed according to the probability 
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distribution as given by Eq. (4.6). The concentration 
of dye at any one time is therefore given by the linear 
superposition of probability distributions of the type of 
(4.6), all starting out from different origins. The Eq. 
(4.6) is a solution of the differential equation, 


d0/dt= A(Dv)+div[ By grad picos’O)x./u], (6.3) 


as can be verified by a straightforward check by differ- 
entiation. The most general solutions of Eq. (6.3) is a 
linear superposition of solutions of the type (4.6), i.e., 
just the type of function which describes the concen- 
tration of the dye in the fluid. The fluid parcel, thus, 
proceeds according to Eq. (6.3). 

In the present theory, since gradp etc. have been 
assumed as constant, the factors of v could be pulled 
through the Laplacian and the divergence, and then 
Eq. (6.3) would look as follows, 


dv/dt= DAv+ (B(cos’6)s/u)(gradp grad). (6.3a) 


The statistical theory of flow through porous matter 
thus yields the result that each drop of liquid gets dis- 
persed while it is passing through the medium. This is 
a fact which should be expected to occur, but which 
has never been described by any of the former theories 
of flow. 

The factor of dispersion D was introduced in a some- 
what formal way, as composed of the (ensemble-) 
standard deviation of the motion of a particle in an 
arbitrary time interval r. It would be desirable to 
have another expression for D in terms of the micro- 
scopic quantities of the motion of each fluid particle. 

This can be accomplished in the following way using 
a procedure established by Langevin. Newton’s equa- 
tion of motion for a particle of the fluid is (force equals 
mass time acceleration), 


B(f—px"*)=x‘n. (6.4) 


This, of course, is a new basic assumption, but Newton’s 
law of motion is generally accepted as valid. In (6.4) f is 
the force per unit volume (apart from the viscous 
forces), p the mass density and yu the viscosity of the 
fluid, B the local reciprocal resistance as defined before, 
and dots in juxtaposition denote time derivatives. 

If one multiplies the last equation by x, one obtains 


Bfx— Bpx"’ x= xx'p. (6.5) 
Now one observes that 
x’ x= (d/dt)(xx’)— x= (1/2) (d?/df)x’— x", (6.6) 
thus that Eq. (6.5) becomes 
Bfx— (1/2) Bp(d?/df)x?+ Bpx’?= (u/2)(d/dt)x*. (6.7) 


The task is now to form the average of this expression 
(6.7) over “the ensemble.”’ The forming of the average 
of x, x’, and x” is no difficulty if it be borne in mind 
that time derivation and averaging can be interchanged. 
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As to the average of fx, the following argument can 
be made. 

First, f and x are independent in an ensemble of 
isotropic and homogeneous porous media. Thus the 
term ((fx)),, will simply be equal to (f)a(x)w. Second, 
the (ensemble) average (f), of f can be calculated using 
the method of Langevin.’ Thus the force f per unit 
volume on a particle at the point x can be split into a 
disordered and an ordered component. The disordered 
component will make the particles deviate from their 
average course and will cancel out on taking the 
average over the ensemble. The ordered force will be 
the same in all samples of the ensemble and will be 
parallel to the direction of —gradp. For a particle 
proceeding at an angle @ with the —gradp direction, 
the force due to the pressure gradient (i.e., the ordered 
component) will be —gradp cos#. The component par- 
allel to the —gradp direction of this is —gradp cos’6. 
If this be averaged over the ensemble, one obtains 


(f)4,= — grad p(cos”6) wy. 
Thus, the average over the ensemble of Eq. (6.7) is 
— B(x), grad p(cos?0) sy 
— (1/2) Bp(d2/d?)( x?) x, 
+ Bo(x"*) = (u/2)(d/dt)( x"). (6.8) 
From earlier results, we may insert (x(¢))4= — (B/n) 
X grad p(cos*@),/ (from Eq. (6.1)) and |x'| =—(B/p) 
X | gradp| cos@ (from Eq. (4.2)) ; thus, the last equation 
becomes 
[(B/u) grad p(cos*6)s, Ft—4(Bp/u) (@/d?)(x*(t))m 
+ (Bp/u)({(B/u) gradp cosé ?)», 
=4(d/dt)(x*(t))m. (6.9) 
This is a differential equation for (x?(¢)),4, whose station- 
ary solution is [assuming (x?(t))4=0 for ‘=0 | 
(32() w= 2(Bp/u)L(C(B/u) gradp cost P) 
—[(B/u) grad p(cos’6)» P je 
+[(B/u) grad p(cos’6), PF. (6.10) 
If this be compared with the earlier expression (6.2) 
for (x?(?)), one obtains 
D= (Bp/u)((((B/n) gradp cos8)*)m 
— ((B/u) gradp (cos’6))?]. (6.11) 
As B and cos@ have been defined in microscopic terms, 
Eq. (6.11) gives the complete connection of the macro- 
scopic factor D with microscopic quantities. 
It is rather awkward to deal with D in an equation 


for 2, as it is a function of the pressure gradient. How- 
ever, the expression for D may be written as follows: 


D= (pa/u*) (gradp)?, (6.12) 

with :: _ 
a= B(((B cos8)*)— (B(cos?6)m)? J. (6.13) 
Here, a is a constant of the porous material, which in 
principle, could be measured macroscopically. Because 


of its connection with the dispersion of a parcel in the 
porous medium, we shall call this constant its “dis- 
persivity.” 

It may be noted that a different interpretation of the 
first term in Eq. (6.11) can also be given. For, 
(1/2)p{x') is some constant, equal to the average 
kinetic energy E per unit pore volume. Therefore, we 
have 


(1/2) p(x?) w= E 
= (1/2)p((B gradp cos8)*)m/u*, (6.14) 
and thus 
[ (B/u) gradp cosé P= 2E/p. (6.15) 


VII. INHOMOGENEOUS FORCES 


The foregoing discussions pertain to a homogeneous 
field of external forces only. However, the case where 
the pressure is varying from place to place in the porous 
medium is of practical interest. 

The transition from a homogeneous field to an in- 
homogeneous one is made along the usual lines: It is 
assumed that in the e-neighborhood of a point, x, y, z 
the forces are “almost” homogeneous. In other words, 
there exists a neighborhood «¢ to each point x, y, 3, 
within which the external forces change arbitrarily 
little. Within such a neighborhood all that has been 
said before applies. 

It is thus clear that all the earlier equations apply 
which involve only the differentials of the displace- 
ments. Thus, the motion of fluid in an inhomogeneous 
field may be summarized as follows: 

Consider a particle of fluid (as defined in Sec. 2) 
whose probability to be at time / at the location 
x, y, , be denoted by v(x,y,2,/). According to the ergodic 
hypothesis this is also the density of an actually larger 
parcel of fluid when it gets spread out through the 
porous matter. 

The differential equation of motion for v is given by 
Eq. (6.3). Because of the variability of gradg, etc., 
the form (6.3) has to be used and not (6.3a). Substi- 
tuting D from Eq. (6.12) and B from Eq. (4.9) yields 


dv/dt=AL[ (pa/u*) (gradp)*v ]+div[o(k/uP) gradp]. (7.1) 


Here, a, k, and P are macroscopic constants of the 
porous medium (dispersivity, permeability and porosity, 
respectively) which may be defined in terms of micro- 
scopic quantities as outlined in the earlier sections of 
this paper. Contrariwise, p and yu are properties of the 
liquid (density and viscosity, respectively). 

From the differential equation of motion it follows 
that a fluid particle in (x,y,z,/) moves with the average 
velocity (V), as can be seen from the solution 4.6: 


(V)w= — (k/uP) gradp, (7.2) 


which corresponds to an average filter velocity V 


(V= (V)wP), 


= —(k/u) gradp. (7.3) 











However, this is no longer equal to the velocity of 
the center of mass of a large parcel of fluid moving 
through the porous medium, nor is it equal to the 
velocity of the average position of a fluid parcel. Those 
two quantities can no longer be calculated simply and 
generally. One has to solve Eq. (7.1) for each particular 
field of force and calculate the required averages. 

Equation (7.1) is not able to give the complete solu- 
tion of the problem of flow through porous matter, as 
it surmises that the pressure gradient be known at 
every point of the medium. There are 3 unknown func- 
tions (viz. v, p and p) and only one equation. Two more 
equations are therefore required to make the problem 
determined. 

One of the two further equations which are required 
is immediately given by the (experimental) relationship 
between the pressure and the density of the fluid, viz. 


p=p(p). (7.4) 


The last equation which is needed, will be furnished by 
the continuity condition. 

Thus, the problem is to formulate the continuity con- 
dition in terms of the probability distribution function 
v(x,/). 

At time /=¢p there will be a certain distribution of 
the fluid in the porous medium. Let this distribution be 
denoted by w(Xo,fo) where the position coordinate is 
now denoted by xo. Naturally, xo has the same range 
of values as x. At time ¢ (arbitrary), the “original” 
distribution of fluid w(xo,fo) will have the probability 
density v(Xx,Xo; ¢,fo) to be at the spot x at time /. Here, 
v(x,X0;/,to) is a “key” solution of the differential Eq. 
(7.1). It satisfies the boundary conditions and is a 
delta function for ‘=t) of the argument x—x». (The 
calculus applicable to delta functions has been discussed 
e.g. by Schwartz.'®) It has therefore properties similar 
to a Green’s function. 

Thus, the distribytion of fluid at time / will be given 
by w(x,t) as follows: 


w(xst)= f o(xx05 tnx) (7.5) 


Now, the physical concept of continuity states that the 
distributions w must be proportional to the densities of 
the fluid times the local porosity and thus we can write 
Eq. (7.5) as follows: 


P(x)p( xf) =f 0(4%)%05 bao (oa) P(x) (7.6) 


This is the required continuity condition. 

The continuity condition in the form written in (7.6) 
is rather awkward to deal with. One would like to have 
it in differential form instead of integral form. 

This aim can be achieved by taking the time deriva- 


16 L,. Schwartz, Ann. télécommunications 3, 135 (1948). 
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tive with respect to / on both sides of Eq. (7.6), one 
thus obtains 


C) ’ dv »Xo; /,l0 
P(x) p(x A) f sn (7.7) 
dt dt 





However, the solution v is subject to the differential 
Eq. (7.1); one can thus insert dv/d¢ from (7.1) into 
(7.7) to obtain 


dp (x,t) pa e 
P(x) rs = f {|S ceaap 





k 
+div| o— grado] p(Xo,lo)P(xo)d*xo.. (7.8) - 


i" 


Upon letting ‘=¢o, one finally obtains 


Pdp/dt= Al (Pp*a/u*) (gradp)*] 
+div[p(k/u) gradp]. (7.9) 


The last step, of course, is based upon juggling around 
with the delta function where relationships such as 


@ grad,6(x— X09) = —@ grad x05 (x— Xo) 
=6(x—Xo) gradzop (7.10) 
have been used. 

In this manner, the problem of flow through porous 
media is solved by the three Eqs. (7.1), (7.4), and (7.9). 

The theory of flow through porous media as it is 
outlined above could be modified in several ways. First, 
it is noted that the pressure is introduced as a given 
“external force’; no pressure fluctuations within “the 
ensemble” are therefore taken into account. This is not 
quite as general as one might desire, and thus a pressure 
probability distribution with corresponding fluctuations 
could be introduced instead of an external pressure. 
Second, the porous medium could be assumed as 
anisotropic. This would necessitate that the perme- 
ability and dispersivity of the porous medium be treated 
as tensors’ instead of as scalars. 

These modifications, however, are fairly straight- 
forward and would add nothing to the actual physical 
ideas of the method of statistics of disorder. Intro- 
ducing them would just drown the physical facts in a 
rather involved and lengthy scheme of mathematics. 


VIII. CONCLUSIONS 


The theory outlined above gives a description of the 
hydrodynamics in porous media. The method of the 
statistics of disorder yields the differential equation for 
the motion of a fluid through a porous medium; if 
certain factors are set equal to zero, the motion is the 
same as described by Darcy’s law. However, if those 
factors are not set equal to zero, then a new macro- 
scopic effect is described of which it is known that it 
must occur: the effect of dispersion. Individual par- 


16 J. Ferrandon, Génie Civil 125, 24 (1948). 
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ticles of the fluid do not only move along the stream- 
lines resulting from Darcy’s law, but they also get dis- 
persed sideways. This is expressed by a new quantity 
denoted by a and termed “‘dispersivity,” which appears 
in the equations of motion. Under certain statistical 
assumptions outlined in this paper, the dispersivity is 
a constant of the porous medium. The fluid flow through 
porous media is therefore, in this instance, determined 
by three macroscopic constants of the latter (porosity, 
permeability, and dispersivity) and not by only two as 
assumed heretofore. 

The introduction of the dispersivity “constant” was 
based upon a variety of assumptions. Some of these 
assumptions were statistical in nature, others mechani- 
cal. Of the statistical assumptions, the ergodic hypothe- 
sis as outlined in Sec. 2 is the most restrictive. Of the 
mechanical ones, the hypothesis that the fluid is a 
continuum and the flow always laminar merit being 
emphasized. Thus, molecular slippage or Brownian 
motion within the fluid are disregarded. 

The treatment of dispersivity as a constant of the 
material is therefore conditional to these assumptions. 


Thus it may be expected that future experimental tests 
will show that the new flow equations (as is also the 
case for Darcy’s law) are valid in certain pressure 
ranges only. Even for restricted pressure ranges, it 
might turn out that the assumptions made, especially 
the ergodic hypothesis, were too narrow. In that case, 
however, it is held that the present theory will be a 
description of what is physically valid in a limit case, 
with the latter being a reasonable approximation to 
physical reality. 

The statistical method would also be applicable with 
fewer assumptions, but the end formulas would be 
much more involved than they already are. In order to 
show the workings of the proposed method, it has 
therefore been thought permissible to make those 
assumptions. 

In conclusion, the writer wishes to acknowledge his 
indebtedness to Imperial Oil Limited, Calgary, for the 
permission to publish this study. Dr. John v. Neumann 
of the Institute for Advanced Study in Princeton, New 
Jersey, read and critically commented upon the work 
presented here; the writer wishes to thank him. 
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New methods for determining the time fraction of coincidence of separate periodic pulse trains are con- 
sidered. The analysis applies solely to pulse trains whose fundamental periods are commensurable. For two 
pulse trains, the coincidence fraction is easily determined. For three or more pulse trains, a method for 
determining the largest possible error resulting from direct use of elementary probability theory is developed. 
Hence the question: “When does probability theory provide a close approximation to the coincidence 


fraction?” is to some degree answered. 


I. INTRODUCTION 


N a paper’ published in this journal, K. S. Miller and 
R. J. Schwarz stated and proved certain funda- 
mental results in the theory of pulse train interference 
or coincidence. The purpose of this paper is to present 
further results in this theory. Some of these results 
make use of probability theory. 

The coincidence problem might arise in communica- 
tions—in the interference of signal pulse trains, for 
example. However, the theory is also applicable to a 
large variety of manufacturing problems, involving 
simultaneous operation of similar machines or conveyors 
of different periodicities. For instance, if one person is 
asked to oversee a number of machines, each of which 
periodically requires his attention, we might ask: How 


1K. S. Miller and R. J. Schwarz, J. Appl. Phys. 24, 1032-1036 
(1953). 


often will two or more machines simultaneously require 
attention? 

Pulse train is abbreviated PT. By a simple pulse train 
(SPT) we shall mean a PT having exactly one pulse 
per period. Figure 1 illustrates two SPT’s, of periods 
T, and T, and pulse widths p; and po, respectively, and 
an initial phase s. By a general pulse train (GPT) we 
shall mean a PT having one or more pulses per period. 


' 
P, — 





























Fic. 1. Two simple pulse trains. 
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Fic. 2. A general pulse train. 
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(Every SPT is a GPT.) These pulses need not have 
equal width (see Fig. 2). 

The work of the Miller-Schwarz paper is assumed for 
the demonstration of the following: 


(a) A new method for solving the coincidence prob- 
lem for two SPT’s. This is a considerable simplification 
of the Miller-Schwarz method. 


(b) The coincidence fraction for two GPT’s can 


usually be approximated very simply by probability 
theory. An upper bound for the error is easily obtained. 

(c) Using the above, the solution for three and four 
SPT’s can be easily approximated. The method can be 
generalized to include five or more SPT’s. 


Il. COINCIDENCE OF TWO SPT’S 


Consider the two SPT’s of, Fig. 1. We shall assume, 
with Miller and Schwarz, that our analysis applies solely 
lo pulse trains whose fundamental periods are commen- 
surable. That is, we assume that all values are integral 
multiples of some number p: 


pi=up T\=mp 
po=vp T.,=np 
s=rp. 


(As far as possible, the author has retained the Miller- 
Schwarz notation.) 

The Miller-Schwarz method can be restated as fol- 
lows: Let g=gcd(m,n) and M=Icm(m,n). We form all 
possible differences x.—*,, where 


x,=1, 2, seep 
xXo=7+1,7r+2, ---, r+. 


Let N be the number of these differences which are 
divisible by gy. Then PT 1 and PT 2 will coincide the 
fraction 
N Ng Ng 
=> = (1) 
M Mg mn 
(The last equality results from the fact that mn= 
gcd(m,n) Xlcm(m,n).) 
The new method? follows, first for r=0 (zero lag at 
some point), then for the general case. 
Let r=0. There exist integers ay, a2 such that 
w=agtn O<n<g (2) 
v=dogt+Tre2 O<re<g. 


2 The referee has pointed out that the coincidence fraction of 
two pulse trains can be computed alternatively in a direct way 
from the cross-correlation function of the two trains. [For some 
general results, see D. Middleton, J. Appl. Phys. 22, 1153 (1951), 
especially Sec. 4. ] 
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Let finax=Max(P1,72), Tmin=Min(7r1,7%2). Let f* denote the 
coincidence ratio for r=k. 
Theorem 1.— 


fo BUF min (g—? max) 


mn 


Proof.—We will count the number .V of differences 
x¥2—.x, which are divisible by g. Table I sets a pattern 
which we can follow. Row by row, we associate with 
each x, all x; such that g divides x.—.: 

We count the differences as follows: For x.=1,2, ---,g 
there are 

d=r;(a,+1)+(g—r:)d, differences (3) 


which reduces to 
d=r,+ga;=un. (4) 


The Table repeats itself every g rows. Hence for x2= 
1, 2, ---, dog there are dou differences. 
For x2=dog+1, dog+2, ---, dog+re there are 


(re(atl)=nartre if re<r; 


d’= (5) 
leilect1)-+ Ge—r Je = rer: if 


ns 
or in either case 
d’=rody tin. (6) 
Hence all values of x2 give 
N=@qyt+reditr min. (7) 
If we multiply both sides of (7) by g, and make use of 
(2), we obtain 
Ng p+ Tain Fuan) 
p=—= (8) 


mn min 


which proves the theorem. 

If r~0, we proceed as follows: Let PT 2* and PT 2** 
be defined as in Fig. 3. We can then apply Theorem 1 
to PT 1 and PT 2%*, and obtain 


u(r+ v) +8 min* (g—f max’) . 
ee lnc canal (9) 


Similarly, we apply Theorem 1 to PT 1 and PT 2**, and 











TABLE I. 

- — —_  o- Total 
x2 x1 per row 
1 1, g+1, 2g+1, ---, aig+1 a,+1 
2 2, g+2, 2g¢+2, gprs aig+2 ai+1 
r ni, gtni, 2g+n, vee, aigt+ni a,+1 

n+l n+l, gtrnitl, ---, (ai—1)g+rit+l ay 
g g, 2g, 3g, «++, aug ay 

gti 1, g+1, 2g+1, ---, agt+l ai+1 
; etc. i 
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obtain 


lias er+fmin** (g—Tmax"’) 


min 





It is easily seen that f*= /*— /** gives the coincidence 
ratio for the r~0 problem. Hence we have proven 
Theorem 2.— 


py+ fom” (g— Sint” es hon (g— ay 


‘ ————— 
where 
r;*=r,**= remainder of u/g 
ro*= remainder of (r+v)/g 
r** = remainder of r/g 
Tmin* = min(r*, 72*) 
Tmax” = max(r,*, ro”) 
Tmin** = min(r;**, ro**) 
tan = max (7,**, ro**) 7 


Corollaries.—W henever r=0 and one (or both) of w, v 
is divisible by g, we have rmin=O and 


pv 
f=—- (11) 


mn 


A similar conclusion may be drawn in the r¥0 case. 
If g=1, then since all remainders must be less than g, 
we have fmin=fmin* =fmin** =O and 


wv 
f’=— forallr. (12) 


mn 


This result can be very useful. In a given pulse train 
experiment, it may be possible to choose period lengths 
which are prime to each other, and this would permit 
simple computations. 


III. COINCIDENCE OF TWO SPT’S— 
THE PROBABILITY SOLUTION 


The formula of Theorem 2 can be rewritten 





MV 
fra—te 
min 
(13) 
Puna” (g— Cun”? — tei (g— ey 
¢= ———_ 
min 


uv/mn (or pip2/T1T:2) is easy to obtain; we might call 

it the natural probability solution to the problem. 

Under what conditions is this a good solution? 
Theorem 3.—For two SPT’s 


pv 
f'=—te 
mn 


O<e<g?/4mn 


where g=gcd(m,n). 
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Fic. 3. Decomposition of PT 2. 
Proof — 
0< min(g— Tuan) a (g—Trmax)- (14) 
If g is even, 
Tmax (S—P max) <g/2(g—g/2) =g?/4. (15) 


If g is odd, 


g+1 gt+1\ g—-1 
T max (¢— aus) S=—(s-“—) = (16) 
2 ? 4 





In either case, we may say that 


0< Tmin(E—Tmax) <2"/4. (17) 
Hence we have 
pv O0< e*¥<g?/4mn 
f'=—+ (P—e”) (18) 
mn O< e**< g?/4mn. 


The maximum error is not increased, but is merely 
allowed to be negative: 


uv 
f'=—ste 
mn 


O<e<g?/4mn QED. (19) 


We now have a condition under which the probability 
solution works: if gK2(uv)', the approximation is usable. 
The main importance of the above solution, however, 
lies in its application to the work below. 


IV. COINCIDENCE OF TWO GPT’S 


We are given the general pulse trains GPT 1 and 
GPT 2, of periods 7, and T2, respectively. These trains 
have k, and k» pulses per period, of widths 1, te, - + +, #k1 
and 2, v2, ---, tke (see Fig. 4). We assume that all pulse 
widths «; and 2;, and all intervals c; and d; are integral 
multiples of some number p. (Again we require that 
all values be commensurable.) Let 


| 
j 
j 
1 
i 
; 
| 
. 
: 


ky ko T\= mp 
A=Lui pe=L7; 
tl i=l T:= np. 
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Then p; and 2 are also multiples of p: pi=up, po= vp. 
Let f be the coincidence ratio for the two trains. 
Theorem 4.—For two GPT’s 


9 


bv g 
f=—ste e< ky ko— —* 
mn 4mn 
Proof. —Let u;= pip, vj= vip, so that 
a oe Em) p= up 
i=] 


or 


h= Sus, 


t=1 
and similarly 
ke 


v=)ov;. 
j=l 


Consider the ith pulse of GPT 1 and the jth pulse of 
GPT 2. From Theorem 3, we know that the coincidence 
ratio for these two pulses is 


Mil; g° 
——- ty 0<e,;<— = (20) 
mn 4imn 


For fixed 7, summation over j gives the coincidence 
ratio for the ith pulse of GPT 1 and all pulses of GPT 2. 
Summation of the latter over i then gives the desired 
coincidence ratio: 











ky ke (yi; pv At ke 
f=rc Ld viii y(+e,). (21) 
i=. =i \mn mn i=. j=1 
But 
| ky ke 
DL U(+«is)|< (22) 
i=l j=l 4mn 
so that 
bv g° 
f=—s+e O<e<kikx— QED. (23) 
mn 4mn 


There is a more exact solution than the foregoing 
probability solution. First of all, when g=1 then (12) 
applies to GPT’s also, and e=0. For g¥1, it is possible 
to use Theorem 2 in a generalized form and thus obtain 
an exact answer. However, the computations will in 
general be tedious. 


“_.. 7 ~ 
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Fic. 4. Two general pulse trains. 
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V. THE PRODUCT OF TWO PULSE TRAINS 


Any two trains GPT 1 and GPT 2 determine a third 
train GPT 12 in an obvious way. GPT 12 is the train 
whose pulses occur whenever GPT 1 and GPT 2 coin- 
cide (see Fig. 5). GPT 12 is called the product of GPT 1 
and GPT 2. 

Of special interest is the case where the two given 
trains are SPT’s. We are given trains SPT 1 and SPT 2 
(Fig. 1) and we form the product train GPT 12. GPT 12 
will have ky pulses per period, of widths 2), #2, - + -, wk. 
Let 


kyo T,=mp g=gcd(m,n) 
pis= Dimi Pie=Ki2p 
it T2=np M=Icm(m,n). 


Let f° be the coincidence ratio for SPT 1 and SPT 2. 
(What follows is independent of initial phase relation- 
ships.) 

Theorem 5.—py.= f’-M. 

Proof.—We know that 


fr=— (1) 


It is easy to see that Vp and pi2= 2p are both measures 
of the amount of coincidence of SPT 1 and SPT 2 in 
the common period Mp. (Mp is also the period of 
GPT 12.) Hence Vp=yy2p and 


wi2=N=f"-M QED. (24) 


We will also need a value for k,2, the number of pulses 
per period of GPT 12. We shall obtain an estimate for 
this value, first when r=s=0 (or when SPT 1 and 
SPT 2 are in phase at some point).. 

Theorem 6.—Let r=s=0. Let a, and a» be defined for 
SPT 1 and SPT 2 as in (2). Then 


4+ @2—1<ky<a;t+a2+1. 


Proof.—Each pulse of GPT 12 is formed by the over- 
lapping of pulses from SPT 1 and SPT 2. In every case, 
the left end of the overlap is determined by the left end 
of one, or both, of the SPT 1 and SPT 2 pulses (see 
Fig. 5). 

In our integer model of two SPT’s (Table I) this 
means that 


kyo=m+n—1 (25) 


where ;= the number of occurrences of x;= 1 in Table I. 
For Table I is essentially a description of all units of 
coincidence of SPT 1 with SPT 2. Each occurrence of 

x,= 1 can be interpreted as an overlap in which the left 
end of the SPT 1 pulse participates. Similarly for x.=1 
and SPT 2. We have counted x,=x.=1 twice; hence 
we subtract 1 from 2;+72. 
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Fic. 5. GPT 12=GPT 1XGPT 2. 


a,+1 O<r 
no= | (26) 
ay, O=r; 
and by symmetry 
ao+1 O<re 
ny= (27) 
a2 O=fro. 
Combining with (25) we have 
Q,+a2—1<kyo<a;+a.4+1 QED. (28) 


Corollary.—If s¥0O then a,+a2—1<ky2<a,+a24+2. 
The proof is similar to the proof of the above theorem, 
except that (25) is replaced by 


ny+n2—1<Ryo< y+. (25’) 


For if s#0 we would have m,=number of occurrences 
of x:=1, m2=number of occurrences of x2.=r+1. If g 
divides r, then there is a duplication as above, and we 
must subtract 1 from +2; if g fails to divide r, there 
is no duplication, and ky2= 1+ M2. 


VI. COINCIDENCE OF 3 OR 4 SPT’S 


We are now in a position to solve the coincidence 
problem for three or four SPT’s. There is, first of all, 
an approximate solution with maximum error estimate. 
For the three-train problem, we proceed as follows: 
first we form the product of two of the trains. Using 
Theorem 6, we can find an upper bound for the number 
of pulses per period of the product train. By Theorem 5 
(with Theorem 1 or 2) we can find the sum of the widths 
of all pulses of the product train. Finally, we use 
Theorem 4 on both the product train and the third SPT. 
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There are three ways to apply this procedure, de- 
pending on which pair of SPT’s form the product train. 
There will be three final coincidence ratios and three 
error estimates, usually unequal. We need not compute 
all three answers, however, since the error estimates 
can easily be computed first, and the approach giving 
the least error chosen. 

The four-train problem is approached in a similar 
manner. We have two pairs of SPT’s, and we form the 
product of each pair. Theorem 4 is then applied to the 
two product trains. There will again be three ap- 
proaches, and the least error approach can be chosen as 
before. 

The maximum error estimate resulting from this 
method may be impractically large. When this happens, 
it is usually possible to arrive at an exact answer, but 
with a corresponding increase in computation. For the 
four-train problem, the exact method would proceed as 
follows: Using the Miller-Schwarz methods, we can 
obtain an exact description of the product train of two 
SPT’s. That is, we can determine exactly where each 
pulse of the product train begins and ends. Hence, we 
can reduce the four SPT’s to two completely described 
GPT’s. The final comment of Sec. IV then applies: we 
can generalize Theorem 2 in exactly the same manner 
as Theorem 3 was generalized to Theorem 4. The 
answer thus obtained will be exact. 


VII. COINCIDENCE OF 5 OR MORE SPT’S 


The methods described above can be generalized to 
include the coincidence problem for any number of 
SPT’s. It is desirable, for simplicity of computation, to 
use Theorem 4 (probability solution) as much as pos- 
sible. Methods of solution can generally be designed so 
that pulse train pairs with large g occur in the earlier 
application of Theorems 1 and 2, and pairs having 
small g in the later probability application. In this way, 
the final error estimate is usually minimized. If the 
error is still too large, however, then exact methods 
must be used beyond the initial formation of produce 
trains. The remarks at the end of Sec. VI then apply. 

The author is indebted to Mr. H. A. Samulon, Mr. 
K. Fong, and Mr. P. Weiss of the Electronics Labora- 
tory, General Electric Company, for their advice and 
encouragement. 
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The Joint Distribution of n Successive Outputs of a Linear Detector* 


Wituiam C. HorrMant 
University of California, Les Angeles, California 
(Received November 18, 1953) 


The joint probability density function of n successive outputs of a linear detector is derived for the case 


of dependent Gaussian inputs. 





1, INTRODUCTION 


HAT a random noise input to a linear detector 
yields an output which is Rayleigh distributed is 

well known.' The independence of random noise from 
pulse to pulse permits us to write the joint probability 


density of m such outputs Rj, Ro, ---, R,' as 
)=11 = ex( z ) (1.1) 
11, °**,%n)=[] —exp{ —— }- é 
J i=l go? 20? 


However, there exist cases, like “‘clutter” in a radar 
system, in which successive pulses are dependent, so 
that the joint probability density cannot be written in 
the relatively simple form (1.1). It is the purpose of 
this paper to derive, under the assumption of Gaussian 
inputs, the probability density function (hereafter ab- 
breviated to p.d.f.) which applies in such cases. 

The input to the linear detector is assumed to be the 
discrete-parameter Gaussian process (Xj, Y;) (j=---, 
—1,0,1,---). In electronics X; and Y; are termed “‘in- 
phase” and “‘in-quadrature” components, respectively, 
and may be thought of as real and imaginary parts of 
the complex Gaussian process (Z;)=(Xj+iY;). The 
output is then of the form (under certain restrictions 
on the intermediate-frequency band width) 


Rj= (X?7+Y/)}, (j=---,—1,0,1,---) (1.2) 


and it is clear that (Rj) represents the modulus of the 
complex (Z;) process. 


2. DEFINITION AND PROPERTIES OF THE 
GAUSSIAN PROCESS 


The (X;,Y;) process will be supposed stationary, i.e., 
the covariance function of (X;,Y;) and (X,,Y;,) does not 
depend explicitly on j and k but only upon the differ- 
ence | j—k|. It follows that the R; process is also sta- 
tionary in the wide sense. 


* A portion of a thesis submitted in partial satisfaction of the 
requirements for the degree Doctor of Philosophy. It is a pleasure 
for the author to express his gratitude to Professor Paul G. Hoel 
for his aid and encouragement in this work. 

¢ Author now at U.S. Navy Electronics Laboratory, San Diego 
52, California. 

1J. L. Lawson and G. E. Uhlenbeck, Threshold Signals (Mc- 
Graw-Hill Book Company, Inc., New York, 1950). 

2S. O. Rice, Bell System Tech. J. 23, 282-332 (1944); 24, 
46-156 (1945). 

* Throughout this paper capital letters will be used to denote 
random variables, and corresponding lower case letters wil] denote 
the associated real variables in the probability density function. 


The first- and second-order moments of the Gaussian 
process are then defined as follows: 


EX ;= EY;=0, . (2.1) 
(j=-++,—1,0,1,- ++) 
EX7= EY 7=y, (2.2 
E(Xj;X,)= E(VjV2)=Vpij-«;,  j¥R (2.3) 
(j,k= nic --~ 1,81,- : -) 
ee aces vp" \j—-K1; 
E(X;) = —E(X xn} j= : (2.4) 
0, j#k 


Let A,, denote the covariance matrix of the 2” random 
variables X,, Yi, ---, Xn, Yn. According to the defini- 
tions (2.1)—(2.4), A, must have the form 


A,=vP,, ( 


i) 
on 
~_—* 


where P,,, is the associated correlation matrix. 

In the derivation of the joint distribution of R,,---,R,, 
properties like (2.3) and (2.4) are required for the ele- 
ments of the inverse correlation matrix P,,~'. The fact 
that these properties do hold is the content of the 
following lemma. 

Lemma 2.1. Let the matrix P, be nonsingular and 
defined by (2.3) and (2.4). Then the elements of P,~ 
are such that 


pu-ltk-le — pritk (2.6) 
pu-btk— —pi2k-1 (7. k=1,2,-++,2) (2.7) 
pri-h2k = — p7/.2k-1=() for j=. (2.8) 


Proof. It is clear from (2.3) and (2.4) that the follow- 
ing relations hold for the elements of P,,,: 


P2j-1,2k-1= P2j,2k 


(j,k=1,2,--+,n) 


P2j—1, 2k= — P2j, 2k—1 
P21, 2k= pr, 2%-1=0 (k=1,2,--+,n). 


Furthermore, P,, is symmetric and nonsingular. There- 
fore, its complex isomorph,‘ C,,= (p2)~1, 2x1 ip2j—1, 2x) is 
Hermitian and non-singular. Consequently a unique 
C,,— exists and is also Hermitian, so that all its diagonal 
elements are real. By the same isomorphism as between 
P,, and C,, there is a one-to-one correspondence between 


‘Garrett Birkhoff and S. MacLane, Modern Algebra (Mac- 
millan Company, New York, 1947), p. 217. 
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JOINT DISTRIBUTION OF n SUCCESSIVE OUTPUTS 


C,.-1= (p24! 24-1- jp?-1:2k) and the 2mX2n real matrix 
2j7—1,2k—1 27—1,2k 
p p 
p=| (7 mo) 
— pi! 2k p- »2k—1 
Since products are preserved under isomorphism, it follows from C,C,-'=I that P,,P,-'=TI. Inspection of the 
above form for P,,-' then establishes the lemma. 


3. THE JOINT DISTRIBUTION OF R,, R2, ..., Rn 


The bivariate distribution of R;, Ry is well known.'? In the general n-dimensional case we start with the multi- 
variate normal distribution of X;, Vy, ---, Xn, Yn, which in view of Lemma 2.1 has the form 


1 nn 
f(%1,%1, oe XnyVn) = (2r)-"y-"|P,,.|—3 exp Oy ) [p?t-t.8-2 (x je+ Vive) + pt * (xy.— xKY;) ] . (3.1) 


y 7,k=1 


After the change of variables «;=1; cos@;, yj=1r; sin; (j=1,2,---,n), the quadratic form in the exponent of (3.1) 
becomes 


Dp?! * In 7, CoS(04—8;) +p **r 7, sin (0,—8;) |= ze, 2+-2>°B'*r 7. cos(0.—0;— o**), 


i,k i<k 
where 8’* and g** are defined as follows: 


pri-t.2k-1 = Bik cosg’*, 
2j—-1,2k Bik ‘ ik (j,k=1,2,- *+\m). 
"ll ain sing*", 


So that B*=6*!, g/*= — g*?, g/i=0. Consequently the desired marginal p.d.f. of Ri, Re, ---, Rn follows, in the form 


forage *yra) = 2m)" Pal TTP; exp(- Sar) [- of “ew| -, Ep cos(0.—0,— 0) [flan (3.2) 


j=1 i<k 


Now consider the multiple integral appearing in (3.2). Since the integrand is continuous, the multiple integral 
may be written as an n-fold iterated integral. Then induction on m, making successive applications of the formula® 


e772 cose — _ (— 1)"J,(z)e'"?, 


n=—o 


to each iterated integral, and a final integration with respect to @,, permits the evaluation of the multiple integral. 
The p.d.f. of Ri, Re, ---, R,» then appears in the form 


n 1 n 
f(ni,re, = Tn) - y" | P,, | “YT; exp( sen?) 
j=l 2y 1 


2, 2 — Bi = 
x pm (-1yr"™ TI Lend (—rn) cos( Ene), (3.3) 
> vik= ZT vem j=l k>j y j=1 k>j 
k<m 
(k =1,2,--+,n) 
OS vik < oo 

where y and P,, are defined by (2.2) and (2.5), respectively; €,;, is the Neumann factor with value 1 for vj,.=0, 
and value 2 for v;,>0; I,(x) is the modified Bessel function of the first kind with imaginary argument; and 
Bit = [ (p21. 2-1) 24 (p?é—1.2k)2]h g’*=arc tan (p?#-1.28 [ fetse-*), (j,k= 1,2,° oft sn). 


5G. N. Watson, Bessel Functions (Macmillan and Company, Ltd., London, 1944), rev. ed., p. 22. 
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The gaseous expansion from an electric spark is described in this paper in terms of ideal gas theory. The 
assumption is made that a quantity of energy is injected into a small gas volume in an infinitesimally short 
time and that the initial volume of gas then expands until it reaches the pre-spark pressure at an elevated 
temperature. Experimental results are given for a condenser discharge illustrating the dependence of the 
pressure equilibrated gas volume upon initial pressure and condenser energy. These results are interpreted 
using the ideal gas model presented. The significance of this expansion to the problem of spark ignition in 


combustible gases is discussed. 





INTRODUCTION 


HE problem of spark ignition in a combustible 
mixture is complicated by the fact that a multi- 
tude of physical and chemical processes is involved. 
The result is that the contribution of the electric spark 
cannot be evaluated fully since the heat energy of com- 
bustion is being added continuously at an unknown rate 
to the thermal energy transmitted initially to the gas 
by the spark. Information as to the role played by the 
spark energy alone in creating a local gas domain at a 
high temperature is obviously necessary before complete 
understanding of the more complex phenomenon of 
spark ignition can be achieved. Thus while the condi- 
tions necessary for ignition have been documented for 
several experimental procedures, there yet remains the 
problem of understanding the expansion and energy 
requirements of the hot gas kernel which follows the 
spark discharge and which in a flammable gas mixture 
precedes the establishment of a steady state combustion 
wave. 

Rotating mirror measurements have been reported 
recently on the expanding channel of the spark by 
Craig and Craggs, Vul’fson and Libin, and Gegechkori. 
Craig and Craggs and Vul’fson and Libin! have studied 
the self-luminosity of the spark, while Gegechkori* has 
made measurements on schlieren photographs. These 
investigators have recorded the rate of expansion along 
the perpendicular to the direction of current flow for 
various values of spark circuit constants. Since the ex- 
panding gas possesses a rather strange symmetry, the 
volumetric expansion of the kernel cannot be deter- 
mined from this type of measurement. Olsen, Edmon- 
son, and Gayhart,* by photographing the kernel outline, 
have shown that the volumetric expansion approximates 
an adiabatic process from the time of shock separation 


* Work supported by Bureau of Ordnance, U. S. Navy, NOrd 
Contract 7386. 

'R. N. Craig and J. D. Craggs, Proc. Phys. Soc. (London), 
B66, 500-511 (1953); K. S. Vul’fson and I. S. Libin, J. Exptl. 
Theoret. Phys. (USSR) 21, 510-513 (1951). See J. M. Meek and 
J. D. Craggs, Electrical Breakdown of Gases (Oxford University 
Press, London, 1953), Chap. X for a complete discussion and 
bibliography. 

*,N. M. Gegechkori, J. Exptl. Theoret. Phys. (USSR) 21, 493- 
506 (1951). 
aes Edmonson, and Gayhart, J. Appl. Phys. 23, 1157-1163 


to the time of pressure equilibration within the kernel. 
While it cannot be inferred from post-shock behavior 
that the observed pre-shock-separation expansion pro- 
ceeds as an adiabatic process, there is little doubt that 
an appreciable fraction of the initial thermal energy of 
the spark is transferred from the kernel even though 
this process is not isentropic. 

The aim of this paper is to describe the hot gas kernel 
in terms of its time-dependent volume and its energy 
content, utilizing ideal gas theory and the experimen- 
tally determined geometrical dimensions of the kernel. 
The objective of the experimental work is to ascertain 
whether the idealized theory can account for the kernel 
behavior to a first approximation under various condi- 
tions. A determination is made of the dependence of 
the pressure-equilibrated volume Vy; upon condenser 
energy 1/2 CX? for an electrical system in which lead 
lengths external to the spark gap have been minimized 
and vacuum condensers have been used. In addition, 
the effect of the initial pressure upon the heated gas 
kernel has been determined for various gases. 


IDEALIZED MODEL OF GAS KERNEL BEHAVIOR 


In a consideration of the expansion of the spark 
kernel after shock wave separation, expressions have 
been derived for the energy content of the hot gas in 
terms of the initial gas conditions and certain measured 
volumetric quantities.* Using an adiabatic model for 
expansion, the internal energy of the kernel at the time 
of shock wave separation is given by Eq. (7) from 


reference 3 
Po VV; 
y—-1hV,7" 


where the subscripts 0, s, and f refer, respectively, to 
the time of the spark, shock wave separation, and pres- 
sure equilibration. While the above equation refers 
specifically to the time of shock wave separation, it 
can be interpreted somewhat more generally in that 
the assumptions limit the development only to the time 
of thermal equilibrium. Thus, if at time (¢) there is 
thermal equilibrium, this equation becomes 


Po [Vy 
E.= = vs| P (2) 
(y-1)LV 7" 
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If during the time for establishment of thermal equi- 
librium no appreciable expansion of the gases has oc- 
curred, then the thermal energy transferred from the 
spark to the gas can be written as 


Po 


Eg=——————__V s?— Vor 3 
oi ere J. (3) 


Introducing a thermal efficiency coefficient defined by 
Ey=n(1/2 CX?), then 
Po 


4CX?= ov 4 
: niy— Vern “ “ 





From Eq. (2) the energy content of the kernel at the 
time of pressure equilibration is immediately obtainable 
[Eq. (8), reference 3] as 


E,;= 





V;—Vo|. 5 
Ga rs s—Vo] (5) 


The percentage of the initial thermal energy remaining 
in the ignition nucleus after the initial expansion de- 
pends upon the amount of thermal energy initially 
transferred and the initial volume of gas affected and 
can be obtained immediately. From Eqs. (3) and (5) 


 o<% 
Ey BY—-1 





(6) 


where 8 is the volumetric expansion ratio V;/Vo. A plot 
of the ratio E;/Epo as a function of 8 is given in Fig. 1. 
Since the value of 8 decreases with a decrease in the 
initial heat content, the limiting value of the ratio E,/Eo 
as the initial energy is decreased is: 


E, g-1 1 
lim —=lim =—- (7) 
EO Ey 8-1 gy-1 





A significant point is that there is an upper limit (1/7) 
to that fraction of the initial energy which remains 
within the ignition nucleus after this type of process and 
that limit is considerably less than unity. 


ENERGY RATIO €,/E, « 100 














VOLUMETRIC EXPANSION RATIO B*¥, /V, 


Fic. 1. Variation in the energy ratio as a function of volumetric 
expansion ratio for y=5/3 and 7/5. 
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Fic. 2. Schematic diagram of the schlieren system showing the 
relative positions of the schlieren components. 


EXPERIMENTAL 


The primary data from which observations are made 
consist of time-controlled flash schlieren photographs 
of the ignition kernel subsequent to the time of pressure 
equilibration. Although the exact time interval from 
discharge to photographic exposure need not be known 
with great accuracy for this work, it is necessary that 
all schlieren photographs for a given series utilize the 
same delay interval so that individual photographs may 
be mutually and quantitatively compared. For the pur- 
pose of this experiment, twenty microseconds from 
spark to photograph is adopted as the standard time of 
observation. 


Optical and Timing Arrangement 


The schlieren system, Fig. 2, is a dioptric one utilizing 
2-inch diameter, 8.5-inch focal length lenses. As a knife 
edge, a square opaque spot has been used together with 
a square aperture defining a light source, thereby pro- 
ducing radial symmetry in the schlieren image.‘ The 
illumination source is a Libessart spark gap with about 
0.25 10-6 second effective duration. The time delay of 
twenty microseconds was introduced by an electronic 
interval timer triggered by electromagnetic radiation 
from the ignition spark gap.® 


Gas Cell 


The gas to be studied was placed in a closed cell 
which is shown by the line drawing and photograph of 
Fig. 3. Through the bottom of the cell passes an insu- 
lated metal rod into which is soldered one of the spark 
electrodes. This electrode is fixed in position with re- 
spect to the cell. Through the top of the cell passes the 
other (noninsulated) electrode holder. This holder has 
been provided with an expansion bellows and a differ- 
ential screw which allows precise setting and alignment 
of the movable electrode with respect to the insulated 
one. At the ends of the cell are two glass windows with 
a diameter of 5 cm and a thickness of 6.4 mm, through 
which the schlieren beam is passed. At 90° to this beam 
axis, an observation window is available for electrode 

‘ Edmonson, Gayhart, and Olsen, J. Opt. Soc. Am. 42, 984 
(1952). 


5 Edmonson, Gayhart, and Olsen, J. Soc. Phot. Engrs. 3, 3, 
135-144 (1952). 
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Fic. 3. Line drawing of the closed gas cell—inset—photograph 
of the cell completely assembled. 
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_ Fic. 4. Schematic diagram of the ignition spark and illumina- 
tion arrangements. (a) Ignition spark—minimized circuit. (b) Illu- 
mination circuit. (c) Ignition spark—thyratron—gap circuit. 
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alignment purposes. The gap electrodes were aligned 
and the gap length measured to within 0.0001 inch 
using a Jones and Lamson optical comparator. On the 
side opposite the observation window is a double bellows 
arrangement which carries a rod of known diameter and 
an electrode stroking pin. The pin allows cleaning of 
the electrodes, a procedure which is occasionally neces- 
sary for the continued satisfactory operation of the 
spark. The rod provides a convenient means for deter- 
mining the magnification of the schlieren system for 
each series of photographs, nominally 3.4. In some cases 
the stroking pin is replaced with a polonium ionization 
sourcey for stabilizing the gap breakdown voltage. The 
cell is mounted so that the axis of the electrodes, plated 
brass pins of 0.64-mm shank diameter, is vertical. 

A gas supply, a mercury manometer, and a vacuum 
pump are connected to the gas cell. The argon, oxy- 
gen, and neon were from cylinders with a purity of 
99.6 percent, 99.5 percent, and 99.9 percent, respec- 
tively. On introducing the test gas to the cell the gas 
chamber was repeatedly evacuated and pressurized until 
the dilution was negligible. The pressure was then ad- 
justed to that desired for the particular experiment. 


Electrical Methods 


For the determination of the relationships between 
the condenser energy and equilibrated volume, a mini- 
mized circuit was employed. The term, minimized cir- 
cuit, is used here to mean one for which the energy 
losses external to the spark gap have been reduced as 
much as is possible with the experimental arrangement. 
There is no control element in series with the ignition 
gap, and the electrical leads and connections are as 
energy loss free as the experimental arrangement will 
allow. Variable vacuum condensers were used for energy 
control inasmuch as the breakdown voltage for a par- 
ticular gas should be determined only by the gas pres- 
sure and the gap length. This voltage was measured for 
each spark with a Sensitive Research electrostatic volt- 
meter. The capacity of the condenser was measured 
in situ with a General Radio capacity bridge, Type 
716-B. The electrical arrangement is shown diagram- 
matically in Fig. 4(a). The gap length was arbitrarily 
set at 1.38 mm. 

For the determination of the dependence of the spark 
kernel volume upon type of gas and the initial pressure 
the measurements at constant condenser energy could 
not be made under constant voltage, constant-capacity 
conditions in a minimized circuit in view of Paschen’s 
Law for spark gap breakdown. However, there is no 
necessity for high efficiency of energy transfer for deter- 
mination of this dependence. Since it was convenient, 
a hydrogen thyratron (Type 5C22) was used as a spark 
‘breakdown control element in series with the gap. This 

t The polonium ionization source was kindly supplied by the 


Canadian Radium and Uranium Corporation, 630 Fifth Ave., 
New York, New York. 
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thyratron is relatively free of time jitter and is capable 
of passing high transient currents.*? In this case, the 
ignition system was as shown in Fig. 4(c). The illumi- 
nation arrangement, as shown in Fig. 4(b), remained 
the same for both experiments. Rather than initiation 
by the statistical breakdown of the ignition gap, the 
spark is initiated by ‘“‘overvolting”’ the gap by triggering 
the hydrogen thyratron in that circuit. This spark dis- 
charge for the 0.01 uf pyranol condenser in the above 
arrangement has been found to have a duration of less 
than 0.5 microsecond. 


RESULTS 


The results of the experimental work may be classified 
into two general categories: those resulting from experi- 
ments performed in a minimized system to determine 
the effect of condenser energy upon the kernel volume 
and those resulting from experiments in the nonmini- 








1S 1.06 2.35 3.65 5.09 


Argon 
1640 mm Hg 





.98 3.05 9.59 15.04 21.05 
Oxygen 
795 mm Hg 


Fic. 5. Schlieren photographs of the pressure equilibrated gas 
kernel in argon and oxygen. Vacuum condensers—minimized 
circuit—twenty microseconds from spark to photograph—milli- 
joules energy given under individual photographs. 


mized system for the determination of the effect of the 
initial pressure upon the kernel volume. 


The Effect of Condenser Energy on Kernel Volume 


This series of measurements was carried out for argon 
and oxygen at a pressure of 795 mm mercury absolute 
and for argon at 1640 mm mercury absolute. Typicat 
schlieren photographs are shown in Fig. 5. The kernels 
were reproducible and of regular outline. The results of 
the measurements are given in Figs. 6 and 7, in which 
the measured pressure-equilibrated volume has been 
plotted as ordinate against the vacuum condenser energy 
(1/2 CX*) as abscissa. In the case of argon, a supple- 
mentary polonium ionization source was used with the 
result that each argon series was taken under constant 





5W. T. Whelan, Trans. Am. Inst. Elec. Engrs. 67, Part IT, 
1303-1307 (1948). 

7K. J. Germeshausen, MIT Radiation Laboratory Series 
(McGraw-Hill Book Company, Inc., New York, 1948), Vol. 5, 


‘ p. 33S. 
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EQUILIBRATED VOLUME 





a 2 4 6 42 wo 2.0 4.0 60 40 m0 
CONDENSER ENERGY - MILLIVOULES - 4 Cv" 


Fic. 6. Plot of equilibrated kernel volume as a function of con- 
denser energy in a minimized circuit. Argon at initial pressure of 
1640 and 795 mm Hg absolute. 


voltage conditions. The breakdown voltage did vary 
considerably from spark to spark for oxygen since sup- 
plementary ionization was not provided. The data 
suggest a power relationship between the condenser 
energy and the equilibrated volume as follows: 


For argon, both pressures 1/2 CX*«V,'*; (8) 


For oxygen 1/2 CX?*« V4", (9) 


The Effect of Initial Pressure on Kernel Volume 


This series, for which the plotted data are given in 
Fig. 8, utilized neon, argon, and oxygen. The systematic 


EQUILIBRATED VOLUME VV, ( fimh*) 





4 2 4 420 20 2 6 1000 
CONDENSER ENERGY MILLIJOULES 
Fic. 7. Plot of equilibrated kernel volume as a function of con- 


denser energy in a minimized circuit. Oxygen at an initial pressure 
of 795 mm Hg. 
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ABSOLUTE INTIAL PRESSURE 


R (inches Hg) 


Fic. 8. Plot of equilibrated volume as a function of pressure. 
Constant energy—approximately 12 millijoules—delivered to 
spark gap. 


dependence of the volume upon the initial pressure is 
shown. Straight lines with’a negative slope of unity 
indicating the dependence V;« (1/Po) have been in- 
serted for each gas. These data, as previously men- 
tioned, were not taken with a minimized system in that 
a 5C22 hydrogen thyratron was used in series with the 
gap and a pyranol condenser was used. However, the 
argon data in the minimized system for the two pres- 
sures do provide a kernel volume vs initial pressure 
check measurement. For this pressure ratio of 2.06, the 
inverse kernel volume ratio is 1.90 which is in con- 
formity with the foregoing relationship. From a com- 
parison of kernel sizes for the minimized and thyratron 
systems it can be seen that the latter system was 
producing an equilibrated kernel equivalent to that 
produced by a minimized system with a 1/2 CX? of 
approximately 12-mj. 


DISCUSSION OF RESULTS 


It is possible to make an interpretation of the experi- 
mental results for the monatomic gases which is con- 
sistent with ideal gas theory. Equation (4) expresses a 
theoretical relationship between the equilibrated kernel 
volume and the initial condenser energy for an efficiency 
transfer coefficient 7. In this expression, it should be 
kept in mind that Vo, 7, and y all may, in general, vary 
in some manner depending upon the spark energy and 
the gas involved. Roth, Guest, von Elbe, and Lewis® 
have measured the energy release at a spark gap for 
monatomic gases for spark energies of the order of 1.0 
millijoule in a minimized system and have concluded 
that it is approximately 0.95 (1/2 CX*). While it does 
not follow necessarily that all of the energy dissipated 


8 Roth, Guest, von Elbe, and Lewis, J. Chem. Phys. 19, 1530- 
1535 (1951). 
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at the gap is effective immediately in establishing a 
temperature in the gas kernel, for lack of specific infor- 
mation on this point the assumption is made here that 
the 95 percent energy transfer is valid. Using this value 
for n and assuming that 7 does not vary appreciably for 
a monatomic gas, Eq. (4) can be solved for the initial 
volume Vo. With this result it now becomes necessary 
to consider whether the experimental results of Eq. (8) 
can be interpreted in a manner consistent with the 
theoretical Eq. (4). This equation may be written 


PoVo Vy Y 
cx-——] (~) -1} (10) 
n(y—1) Vo 


If [V,;/Vo]* is large as compared to one, then 


tom 


PV) 0-1 
[Vi], (11) 


n(y—1)Vor 


wie 


CX*= 


Thus it is seen from the comparison with the experi- 
mental result [Eq. (8)] that if experiment and this 
theory are to describe the same physical phenomenon, 
then V; and Vo must vary in such a way that the coeffi- 
cient of V,;'* in Eq. (11) for a given pressure remains 
independent of energy. Thus the dependence of V» upon 
condenser energy is given by Vox (CX?)}. 

Typical values for the initial volume Vo, the expan- 
sion ratio 8, and the equilibrated energy-initial energy 
ratio E;/Eo are tabulated in Table I for an initial pres- 
sure of 795 mm Hg. Here, as computed from Eq. (6), 
the ratio E,/E» varies according to the spark energy 
injected to the gas. In all cases, however, it should be 
noted that a major fraction of the initial energy has 
been utilized in the expansion function. These results 
are presented for monatomic gases only, for while oxygen 
satisfies the same type of power law, lack of knowledge 
of the dependence upon the condenser energy of the 
specific heat ratio y and the fraction 7 of the electrical 
energy which appears as thermal energy instantaneously 
(as far as the expansion is concerned) makes it fruitless 
to follow through a parallel analysis for diatomic gases. 

When 1/2 CX? is set as a constant of a system, it 
follows from Eq. (4) that the right member of this 
equation must be independent of the initial pressure of 
the gas. A sufficient condition that this be true is that 


Taste I. Typical values for initial volume, expansion ratio, and 
equilibrated energy-initial energy ratio at an 
initial pressure of 795 mm Hg. 











Condenser Equilibrated Initial Expansion Energy 
energy volume volume ratio ratio 
(mj) V;(mm?) Vo(mm*) E;/Eo 
0.1 0.215 0.042 5.12 0.289 
0.2 0.375 0.061 6.15 0.263 
0.4 0.67 0.094 7.13 0.242 
1.0 1.43 0.160 8.94 0.213 
2.0 2.52 0.235 10.72 0.191 
4.0 4.45 0.348 12.79 0.171 
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TaBLeE II. 
Vol. of gas 
Affected 
Gap Condenser mm? 
Investi- width type and Condenser initial 
gator mm Gas range energy conditions 
Coward 1.00 8.4% Air About 0.90 
Meiter Methane- 600 uwuf = =—3.76 mj* 
air at 752 
mm Hg 
Edmonson 1.38 Argon at Vacuum 2.6 mj 0.31 
Olsen 795 mm 2123 ppf 
Gayhart Hg 
Argon at 3.76mj 0.36 
752 mm 


Hg 








* Coward and Meiter do not give explicit values of voltage and capacity 
or condenser energy for their experiments. They do, however, quote 
Morgan's value of 0.0009 cal (3.76 millijoules) for the least igniting energy 
of a condenser spark in an 8.8 percent methane-air mixture and imply that 
their spark energy was little different from this. __ 

>’ Vo calculated by extrapolation of the 795 mm Hg pressure line of 
Fig. 6 and application of a pressure correction of 795/752. 


each term be independent. Thus if in the subtractive 
term 


PoVo 


n(y—1) 


n and ¥ are not dependent upon pressure, Vy (1/P»). 
This result together with the term 


PW; 
n(y—1)Vor 


predicts a dependence of V;« (1/Po). This is the ex- 
perimental result which has been obtained and is illus- 
trated in Fig. 8 for both monatomic gases and oxygen. 

Evidently independent data do not exist at this time 
which would serve as a check upon the reliability of the 
calculation of the initial volume of gas affected by the 
spark. One point of interest, however, is a comparison 
of the volume of argon initially affected by the spark 
with that volume of a methane-air mixture consumed 
by a sub-critical ignition spark as measured by Coward 
and Meiter.® Since there is some combustible consumed 
after the cessation of current flow in the spark, even 
though a sustained combustion wave is not established, 
it is to be expected that the total volume of gas affected 


*H. F. Coward and E. G. Meiter, J. Am. Chem. Soc. 49, 396- 
409 (1927). 
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will be greater than for a non-combustible gas. The 
experimental conditions and results for the two experi- 
ments are summarized in Table II. The two results are 
consistent and seem to be in reasonable agreement, 
particularly when the essential differences in the quan- 
tities measured are considered. 


CONCLUSIONS 


Although the experimental work discussed has covered 
but a small number of the experimental variations pos- 
sible, the various analyses, correlations, and computa- 
tions with the experimental data have produced results 
which appear to be in conformity with the theoretical 
concepts based upon thermal equilibrium in an ideal 
gas. Therefore, assuming the sufficiency of application 
of the ideal gas theory to the processes in monatomic 
gases, the spark kernel energy can be derived from a 
knowledge of the pressure equilibrated volumes. For 
gases other than the monatomic ones, the lack of knowl- 
edge as to the specific heat ratio and dissociation at the 
temperature involved makes a similar discussion unjus- 
tifiable. However, the fact that the expansion process 
quickly removes an appreciable amount of energy from 
the spark kernel in a monatomic gas, projects the possi- 
bility that a parallelism exists in the ignition process. 
While the expansion may be considered as a funda- 
mental process necessary for ignition of a gas by an 
electric spark, the energy of expansion, once lost from 
the gas kernel, cannot further participate in the mecha- 
nism of propagation of a combustion wave. It then 
follows that the electric spark, in effecting ignition in a 
flammable gas mixture, affects a localized mass of gas 
whose behavior is thereafter transient in character, and 
the energy transferred initially is employed in part 
usefully in performing its ignition function and in part 
uselessly in the regions of the gas mixture beyond that 
from which the flame originates. 
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It is shown that fine precipitated particles present in single crystals of Fe:NiAl are elongated and that 
the shape anisotropy of these particles is a major factor in determining the coercive force of Fe2NiAl. For 
fields of less than 7000 oersteds, (111) directions are the directions of easy magnetization in the (110) plane 
of the crystal; this is due to the effect of the shape of the precipitated particles, which are oriented in three 
mutually perpendicular directions. In higher fields, the torque due to this effect decreases and then the 
[001] direction is the direction of easy magnetization because of intrinsic crystal anisotropy. These con- 
clusions are based on an analysis, and on measurements, of the torque on three mutually perpendicular rods 
of magnetic material. These rods simulate the preciptated particles in the actual crystal. 


HE most important permanent magnet alloys 
today (the Alnicos) are considered to be made 
up of fine particles' having dimensions of the order of 
10-* cm. The particles are too small to permit the 
existence of domain boundaries and therefore the 
processes of magnetization in these particles take place 
largely by rotation.’ This is a high-energy process and 
it results in obtaining a high coercive force which may 
be calculated as 


H.~E/I,, 


where /, is the saturation intensity of magnetization 
and £ the energy associated with the anisotropy. In 
general, we may be dealing with various types of anisot- 
ropy that result from: the shape of the particle, the 
crystal energy, and the stress. Recently Néel* has pro- 
posed a new anisotropy depending on the surfaces of 
the particles. 

The anisotropy of single cubic crystals of permanent 
magnet alloys has been measured by a number of 
investigators*~’ using the torsion method. Briefly this 
consists of measuring the torque necessary to rotate a 
single-crystal disk specimen in a strong uniform mag- 
netic field and then plotting the torque per unit volume 
as a function of the angular position of the crystal. 
From the measured values of torque, the magnetic 
anisotropy constant K is computed. For the (001) 
plane, the theoretical expression for the torque per unit 
volume is 

L=—(K/2) sin4#, 


where @ is the angle between the field and the [100] 
direction. For the (110) plane, 


= — (K/4) sin26— (3K/8) sin4@. 
The second anisotropy constant Ke is here neglected. 


1 L. Néel, Compt. rend. 225, 109 (1947). 

2 E. C. Stoner and E. P. Wohlfarth, Nature 160, 650 (1947). 

3C. Kittel, Phys. Rev. 70, 965-971 (1946). 

4L. Néel, Compt. rend. 237, 23 (1953). 

5 N. V. Bugakov, Doklady Akad. Nauk S.S.S.R. 69, 4 (1949). 

*K. Hoselitz and M. McCaig, Proc. Phys. Soc. (London) B64, 
549 (1951). 

7R. D. Heidenreich and E. A. Nesbitt, J. Appl. Phys. 23, 352 
(1952); E. A. Nesbitt and R. D. Heidenreich, J. Appl. Phys. 23, 
366 (1952). 


In most cases for homogeneous single crystals these 
formulas are satisfactory for computing the anisotropy 
constants of the crystal. Permanent magnet crystals, 
however, have a heterogeneous structure’ and therefore 
the situation is more complicated. This is illustrated by 
torque curves obtained on a single crystal* of Fe,NiAl 
in the (100) plane as shown in Fig. 1. The torque is 
positive at an H of 5000 oersteds when the field angle @ 
is 25°, and it decreases with increasing 17. At an H of 
12 000 the torque is slightly negative and as H is 
further increased to a value of 25 000 the torque becomes 
increasingly negative. Torque curves for the (110) 
plane which act in a similar manner are shown in Fig. 2. 
This is an unusual behavior since the torque for 
homogeneous crystals, in general, rises with increasing 
H until magnetic saturation is reached and then remains 
constant. 

These unusual torque curves, however, can be ex- 
plained on the basis of the physical structure of the 
crystal. Figure 3 shows an electron micrograph of the 
(110) surface of a single crystal of Fe:NiAl in which the 
material has a plate-like structure. It is this type of 
structure which is responsible for the unusual behavior 
of the torque curves shown in Figs. 1 and 2. It should 
be pointed out, however, that the structure shown in 
Fig. 3 was obtained by over-aging the crystal in order to 
make the plate-like structure visible in the electron 
microscope. The coercive force of this structure is 124 
oersteds. 


TORQUE CURVES ON WIRES OR RODS 


The effect of a plate-like microstructure such as 
shown in Fig. 3 on torque curves was determined by 
studying analytically and experimentally, models simu- 
lating this microstructure. The analysis was simplified 
considerably by using elongated cylinders or wires in- 
stead of plates. Three models were made by imbedding a 
single wire, two wires crossed at 90°, and three mutually 
perpendicular wires in blocks of Lucite so that torque 
curves could be obtained on each structure. The wires 
were made of annealed molybdenum permalloy and had 


* Crystals were prepared as described in reference 7. 
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SINGLE CRYSTALS OF Fe:;:NiAIl 


a length-to-diameter ratio of approximately 9. A torque 
curve on a single wire at H=4000 oersteds is shown by 
Curve 1 of Fig. 4. The curve is skewed and because of 
this skewness, if the curve is displaced 90° and replotted, 
the resultant of these two curves will not be zero. A 
curve of twice the frequency is then obtained for the 
resultant. The peak value of torque at this field strength 
is almost equal to that for the single wire but on a 
torque per unit volume basis it is reduced by a factor 
of two. Figure 5 shows the torque curve obtained on a 
single wire at H=7000 oersteds. Note that the curve 
is now skewed less at this higher field strength and that 
the resultant curve for 2 wires is consequently of lower 
amplitude. If H is increased further, the torque curve 
for a single wire tends to become symmetrical in the 90° 
interval and the resultant of the two curves, which are 
displaced 90° from each other, approaches zero. From 
this it is apparent that the torques on two single wires 
displaced 90° from each other tend to become equal in 
magnitude and since they are of opposite sign, the 
resultant torque approaches zero. 

Torque curves (Fig. 6) taken directly on crossed 
wires that are rotated about an axis perpendicular to 
the plane in which the wires lay agree, of course, with 
the resultant of the two single wires displaced 90° from 
each other. It should be noted that the easy direction of 
magnetization for these crossed wires occurs when the 
direction of the field bisects the angle between the 
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wires. This corresponds to a [110] direction in the 
(001) plane in the case of the crystal. The amplitude 
of the torque curves decreases and the skewed appear- 
ance of the curves tends to disappear with increasing 
field. It is shown analytically in Appendix I that the 
torque due to wires crossed at 90° approaches zero as 
the field becomes very large. 

For the length to diameter ratios of the plate-like 
structures that may be encountered in permanent 
magnet crystal structures, the H required to obtain 
zero torque is very high. This is indicated by the high 
fields (H=15 000) used to obtain the data in Fig. 6, 
and even these fields were not sufficient to saturate the 
wires. In addition, the crystal (Fig. 2) was not saturated 
for values as high as 31 000 oersteds. Further insight 
into the problem may be obtained analytically as shown 
in Appendix II. The curves for crossed wires obtained 
analytically are shown in Fig. 7 and they agree closely 
with the experimental curves. It is evident from the 
curves and from Eq. (6) in the Appendix that an 
infinite value of field would be required to reduce the 
torque of these crossed wires to zero. 

Torque curves obtained on the model having three 
mutually perpendicular wires are shown in Fig. 8. The 
model was oriented so that the axis of rotation was 
perpendicular to a plane corresponding to a (110) plane 
in the crystal. These torque curves have the same sym- 
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Fic. 1. Torque curves for Fe2NiAl single-crystal disk (100), cooled from 1300°C at approximately 8°C/sec. 
Coercive force of disk is 614 oersteds. 
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Fic. 2. Torque curves for Fe2NiAl single-crystal disk (110), cooled from 1300°C at approximately 8°C/sec. 
Coercive force of disk is 620 oersteds. 
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Fic. 3. Electron micrograph of single-crystal disk of Fe2NiAl 
(110), cooled from 1300°C at approximately 8°C/sec, then aged 
at 820°C for 7 hr. Coercive force of disk is 124 oersteds. 





metry as the curves for the (110) crystal disk and the 
amplitude of the curves decreases with increasing field. 


RESULTS ON SINGLE CRYSTALS 


On the basis of the results obtained on the crossed 
wires, the unusual behavior of the torque curves of the 
crystal shown in Figs. 1 and 2 is attributed to the shape 
anisotropy of the precipitate which is oriented in the 
three (100) directions. In the case of the crystal, how- 
ever, the situation is more complicated since the torque 
actually reduces to zero and then changes sign. This is 
because there is crystal anisotropy in addition to shape 
anisotropy and the former has a different easy direction 
of magnetization from the latter. At low fields, the 
effect of shape predominates and at high fields crystal 
anisotropy is more important. 

If we increase the shape effect of the precipitate by 
over-aging the crystal, the torque due to shape effect 
is predominant at all fields used. This is shown in Fig. 9 
by a series of curves of apparent anisotropy constant vs 
H, obtained after subjecting the crystal to different 
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treatments which resulted in various coercive forces H.. 
For an H, of 600 oersteds the specimen is not over-aged 
and the apparent anisotropy constant which is negative 
decreases in magnitude to zero and finally becomes 
positive as H is increased. This is from the same data 
as are shown in Fig. 2. When the crystal is over-aged 
somewhat, H, is 300 oersteds and the torque cannot be 
reduced to zero for values of field as high as 20 000 
oersteds. Upon further overaging H, is 47 oersteds, the 
shape effect is now very well developed and at an H 
of 20 000 the apparent anisotropy constant has the very 
high value of 600 000. 

Figure 10 is an electron micrograph of the crystal in 
the latter state (H.=47) and the well-developed shape 
anisotropy is evident. The coercive force is low in this 
structure because the particle size is large enough to 
permit domain wall motion. In contrast to this struc- 
ture, Fig. 11 shows an electron micrograph of a crystal 
for which H, is 525 oersteds. Here the elongation of the 
precipitate cannot be resolved in the electron micro- 
graph but the behavior of the torque curves indicates 
that the precipitate is elongated. 

The torque curves in Figs. 1 and 2 were obtained on 
single crystals of Fe2.NiAl in the (100) and (110) planes, 
respectively, when H,.=600 oersteds. The results show 
qualitatively the same effect with increasing fields. In 
both cases, the torque reduces with increasing field, 
passes through zero and then changes sign. The torque 
curve appears to pass through zero at somewhat higher 
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Fic. 4. Torque curve for a single Mo Permalloy wire at 
H = 4000 oersteds. 
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Fic. 5. Torque curve for a single Mo Permalloy wire at 
H=7000 oersteds. 
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Fic. 6. Torque curves for two Mo Permalloy wires at right 
angles to each other. 


values of H for the (100) crystal plane (H=12 000) 
than for the (110) plane crystal (H=7000). This is 
also shown by the experimental torque curves for 2 
wires (100) and 3 wires (110), the latter structure 
having a lower amplitude of torque at high fields than 
the former. 


In order to obtain the final value of the crystal 
anisotropy, it is necessary to extrapolate to infinite H, 
and when this is done values of K of 110 000 and 150 000 
ergs per cm*® are obtained for the (100) and (110) 
planes, respectively. 

The relative importance of the shape and crystal 
anisotropies in determining the coercive force was found 
by measuring the directions of easy magnetization, coer- 
cive force, and saturation magnetization at elevated tem- 
peratures. A crystal disk with a (110) face was placed 
in the torque magnetometer and heated to 600°C. 
Full torque curves were not measured at 600°C; at 
high field strengths (7=18 000) the crystal was iso- 
tropic. The [001] direction of easy magnetization had 
disappeared, indicating that the crystal anisotropy had 
been practically reduced to zero at this temperature. 
At low field strengths (H =4000) however, substantial 
torques were still necessary to turn the crystal away 
from (111) directions. Therefore the coercive force of 
the crystal at 600°C which was found to be 320 oersteds 
must be due almost entirely to shape anisotropy. 

To find the value of coercive force due to shape 
anisotropy at room temperature, a temperature run 
(Fig. 12) on a polycrystalline bar of Fe.NiAl was made 
to determine the change in saturation with temperature. 
The coercive force of the bar was 320 oersteds at 600°C 
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Fic. 7. Calculated torque curves for two wires at right 
angles to each other. 


and since the coercive force due to shape anisotropy is 
proportional to the saturation, the room temperature 
value of coercive force should be 465 oersteds. This 
leaves approximately 155 oersteds to be accounted for 
by crystal anisotropy and therefore shape anisotropy 
accounts for about 75 percent of the coercive force. 
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AND BOZORTH 


CONCLUSIONS 


As a result of this work, it appears that shape anisot- 
ropy of the fine particles is responsible for the unusual 
behavior of the torque curves obtained on single crystals 
of Fe.NiAl. We have not been able to detect elongated 
particles in electron micrographs taken when the crystal 
is in its high coercive force state. Neither is shape anisot- 
ropy apparent, at first, from torque curves obtained on 
these crystals since the curves show a fourfold sym- 
metry in the (100) plane which is indicative of cubic 
crystalline anisotropy. However, our analysis shows 
that the fourfold symmetry obtained at low values of H 
(about 5000) is due principally to shape anisotropy of 
plates (made up of rods’) oriented 90° to each other. 
The torque resulting from the shape of the plates 
arranged at 90° to each other eventually becomes zero 
at infinite H. Therefore, by extrapolation, we have been 
able to obtain the crystal anisotropy of the alloy. The 
relative importance of the crystal versus the shape 
anisotropy has been determined by making measure- 
ments at 600°C where the crystal anisotropy prac- 
tically disappears. As a result of these measurements, 
it is concluded that shape anisotropy is a major factor 
in determining the coercive force of single crystals of 
Fe,NiAl. 

The experimental work, also, supplies us with a simple 
explanation for the easy direction of magnetization in 
these crystals. Under permanent magnet conditions and 
at field strengths of less than 7000 oersteds the [111] 
direction is easy. This is due to the shape effect of three 
sets of plates mutually perpendicular to each other. At 
higher fields, the effect of shape on the anisotropy tends 
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Fic. 8. Torque curves for three mutually perpendicular wires measured in the plane corresponding to (110). 
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SINGLE CRYSTALS OF Fe:NiAl 


to diminish and the easy direction becomes the [001 ] 
direction as a result of crystal anisotropy. 

Since Fe,NiAl is closely related to Alnico 5 and since 
its electron micrographs are similar, the explanations 
for the easy directions of magnetization in crystals of 
Fe.NiAl also apply to Alnico 5, which is being inves- 
tigated in a similar manner. 

The anisotropy recently ascribed by Néel* to the 
surface effects of small particles can contribute to the 
torque in the same way as the shape anisotropy does. 
If surface anisotropy plays a significant role in Fe,NiAl, 
we can still conclude that fine precipitated particles 
are present as plates. When the particles are large the 
surface anisotropy becomes negligible but the shape 
effect persists; when they are small, surface effects are 
present along with shape anisotropy. 
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Fic. 10. Electron micrograph of single-crystal disk of Fe,NiAl 
(110), cooled from 1300°C at approximately 8°C/sec, then heated 
at 820°C for 35 hr. Coercive force of disk is 47 oersteds. 
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Fic. 11. Electron micrograph of single-crystal disk of Fe:NiAl 
(100), cooled from 1300°C at approximately 8°C/sec. Coercive 
force of disk is 525 oersteds. 
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perature on a polycrystalline bar of Fe:NiAl. 
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APPENDIX I 
Torque in High Fields 


According to Fig. 13, the torque Z per unit volume 
for one wire is 


L,=HI, sina. (1) 


This is the torque resulting from the action of the 
applied field H on the magnetization of the wire; op- 
posing it is an equal and opposite restoring torque 
which tends to align the domains along the length of 
the wire; this is equal in magnitude to 


L,=—[Ni, sin(@—a;)— Nil, sin(@@—a) J, cos(@—a) 
=—43(N,—N)I- sin2(@—a), (2) 


where NV, and N; are, respectively, the transverse and 
longitudinal demagnetizing factors of the wire. When 
the wire is completely saturated, le | and the torque 
reduces to 

L,=—}(N.—N DI sin26. 
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Fic. 13. Two wires crossed at right angles, simulating rods 
of precipitation. 


The torque for the other wire at 90° to the first is 
opposite in sign to Z; and is 
L2=}(Ni—N I sin2 (6+ a2). 
When a2=0, the resultant torque of the 2 wires is 
L=}(L:+L2)=0. 


This limit is approached in very high fields. 

When the wires are nearly saturated a, and ae are 
small and nearly equal. As a first approximation we put 
sina2= sina;= a, and find the resultant torque per unit 
volume to be 


L=4(L)4+-L2)=—}(N.—N DI P[sin2 (@—a) 


—sin2(0+a2) J=ai\(Ni—N)IZ cos20. (3) 
From Eqs. (1) and (2), at all values of a; 
HI, sinay=}3(N.—N I sin2(6—a,). (4) 
When a; is small, 
a,=4(N.—N)I,(sin20)/H (5) 


for the one wire, and a similar expression in a2 applies 
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Fic. 14. Curves Bwing how the torque varies with H for two 
complementary values of @. The resultant torque for two per- 
pendicular wires is } of that shown by the difference curve. 


WILLIAMS, 


. wires (Fig. 6). 


AND BOZORTH 


to the other wire. The torque per unit volume for the 


pair of wires is then, from (3) and (5) 
L=}(N,.—N)*/,3(sin46)/H, (6) 


which approaches zero as H becomes indefinitely large. 
This relation is in accord with the period and amplitude 
of the experimentally determined curves on crossed 


APPENDIX II 
Torque in Field of Any Strength 
From Egs. (1) and (2) of Appendix I, 
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Fic. 15. Curves showing torque vs field-strength for particles 
having various transverse and longitudinal demagnetizing factors 
N,and N,, and saturation /,. 


A value of @ is selected, and these two expressions are 
calculated for this 6 and various values of a; and a curve 
plotted as shown in Fig. 14. More specifically 
L,/(N.i—N DI. is plotted as a function of H/(N:i—N dl 
with @ constant. The resultant torque, L/(N:—N)I?, 
of two crossed wires is half the difference in height of 
the ordinates of the curves for 6 and 90°—@. From a 
series of such curves, curves of the type of Fig. 14 can 
be obtained directly. The required value of H is selected, 
I is determined by the material, and NV, and NV; by the 
physical dimensions of the wire. In the experiments 
performed J,= 690, V,= 6.226, and N ,;=0.113. 

Curves for various other values of 6 are given in 
Fig. 15. Similar calculations and experiments have been 
carried out for three mutually perpendicular wires. 
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An Electrodynamic Perturbation Theorem, with Application to Nonreciprocal Systems 


ALFRED G. REDFIELD* 
Gordon McKay Laboratory, Harvard University, Cambridge, Massachusetts 


(Received November 16, 1953) 


By means of a slight modification of the macroscopic electrodynamic reciprocity theorem it is possible 
to obtain the perturbation of the admittance matrix of an electrodynamic device due to a suitable per- 
turbation of the dielectric susceptibility, conductivity, and magnetic susceptibility tensors in the device. 
The theory is applicable to nonreciprocal systems, but in practice can be used to obtain only the first-order 
perturbation of the admittance matrix. An expression relating the current induced in part of a system due 
to the motion of current in another part of the system is similarly obtained. Dc and microwave gyrators 
are treated, and the connection is pointed out between Faraday rotation in a transmission cavity and the 
magnetic energy splitting of its circularly polarized normal modes. 





INTRODUCTION 


HE macroscopic electrodynamic reciprocity the- 
orem! was first proven by Carson* and by 
Pleijel® for systems in which radiation is important. 
As Carson pointed out,‘ his treatment is not valid for 
the case of transmission through an ionized gas in a dc 
magnetic field. Recently, several authors®:* have pointed 
out other mechanisms leading to nonreciprocity in 
electrodynamic systems, and several useful nonrecip- 
rocal devices have been build.?'§ The nonreciprocity of 
such a device is a consequence of the asymmetry of the 
dielectric constant, conductivity, and magnetic suscep- 
tibility tensors of matter in the device, brought about 
by application of a large magnetic field. The proof of the 
reciprocity theorem requires that these tensors be 
symmetric. 

In this paper a useful modification of the reciprocity 
theorem is derived for a generalized system. This theo- 
rem is used to evaluate the antisymmetric part of the 
admittance matrix of an electrodynamic system in 
terms of the electromagnetic fields produced by the 
applied voltages in the system, and to obtain the per- 
turbation of the admittance matrix produced by suitable 
perturbations of the system. These results are applicable 
as long as the macroscopic Maxwell equations are, and 
can be used to treat dc and microwave gyrators, as well 
as reciprocal systems. The main limitation to the 
method is that it can only be carried out to a first order 
of approximation. It seems likely that similar methods 
can be applied to mechanical, electromechanical, and 
hydrodynamic systems. 


* National Science Foundation Postdoctoral Fellow. 

'S. A. Schelkunoff, Electromagnetic Waves (D. Van Nostrand 
Company, Inc., New York, 1943), p. 476. 

* Montgomery, Dicke, and Purcell, Principles of Microwave 
— (McGraw-Hill Book Company, Inc., New York, 1948), 
p. 141. 

*J. R. Carson, Bell System Tech. J. 3, 393 (1924); 9, 325 (1930). 

‘J. R. Carson, Proc. Inst. Radio Engrs. 17, 952 (1929). 

°H. Suhl and L. R. Walker, Phys. Rev. 86, 122 (1952). 

®°E. M. McMillan, J. Acoust. Soc. Am. 18, 344 (1946). 

7C. L. Hogan, Bell System Tech. J. 31, 1 (1952). 

* Mason, Hewitt, and Wick, J. Appl. Phys. 24, 166 (1953). 


GENERAL THEOREMS 


Consider a system C’ which is bounded but not neces- 
sarily closed. We consider steady-state solutions of 
frequency w, and all field quantities can be complex 
according to the usual ac convention. The system is 


assumed to obey the macroscopic Maxwell equations 
(MKS units): 


VX H’—iwkE’—u’=0, (1’) 


(2’) 
where u is the total real current (conduction plus dielec- 
tric polarization but not displacement), M is the mag- 


netization, and the other symbols are conventional. The 
current and magnetization are given by 


VX E’+ iwuo(H’+M’)=0, 


u’ = (iwx-€o+c) (E’+F’) =\(E’+ F), 
M’=x»(H’+ 8’), 


(3’) 
(4’) 


where x-, ¢, and x» are the electric susceptibility, con- 
ductivity, and magnetic susceptibility tensors (which 
can be nonsymmetric), and F’ and 8’ are the electro- 
motive and magnetomotive forces or sources which give 
rise to the electric and magnetic fields in the system. 
(This definition of emf and mmf is rather arbitrary but 
is easily transformed to any other definition.) The 
tensor A is introduced for convenience. 

We also consider a second system C” in which the 
fields also obey Maxwell’s equations and are set up by 
a second distribution of forces F” and 8’, characterized 
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by the equations 
u’= (A+6A) (E”’+ F”), (3’’) 
M” = (xm+5xm)(H”+6”), (4””) 


where 5A and 5x», are defined analogously to \ and xm, 
and express the difference between systems C’ and C”. 
The field quantities in Eq. (3’’) and (4’’) are regarded 
as functions of the same space coordinate x as those in 
Eqs. (3’) and (4’). Then the following identity holds: 


fx H”—E”XH’)-ds 
+ ftw’ F’—y". F’ —iwuo(M’- g’—M”. 8’) Jdv 


= f [ teopto (H’+ 8’) (2xm°+5Xm) (H+ 8”) 


— (E’+F’) (242+) (E"+ F”) Jdv, (5) 
where xm* and A® are the antisymmetric parts of xm 
and X, and the surface integration is over the surface of 
the region of the volume integration, which is arbitrary. 
The derivation of this equation is simple and very 
similar to that of reciprocity theorem!’ and is therefore 
omitted. 

To apply Eq. (5) to get the asymmetry and pertur- 
bation of the admittance matrix o;; of an electro- 
dynamic device (Fig. 1a) the integration of Eq. (5) is 
taken over all space and the terminal loops A and B 
are assumed equivalent to plain loops in which |F| 
=V,/A and V,/A for an infinitesimal length A, and 
in which F points along the wire at the point where A 
is located (Fig. 1b). The surface integral vanishes at 
infinity? and one easily obtains 


0 jj —o;/ = J (ico Oxnt-+ x0) HL” 


—E,/(2d°+8A)E,"]dv, (6) 
where E,’, H,’ are the fields in the unperturbed device 
(characterized by \ and x») when V;=1, Vi=O (R¥ J), 
and E,’, H,’”’ are the fields in the perturbed device 
(characterized by 


A+6A, Xmt+5xm) when V;=1, Vi=O (R¥i). 


Equation (6) can easily be generalized to include 
magnetic and waveguide inputs. In the latter case the 
region of integration is taken to exclude the interior of 
the microwave generator and the result is identical to 
Eq. (6), under the waveguide voltage and current 
conventions of reference 2. If the unperturbed system 
is reciprocal, Eq. (6) can be used to obtain the change 
5o;; due to the perturbation. If the perturbation is zero, 
Eq. (6) gives the antisymmetric part of the admittance 
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matrix. The preceding results are obviously applicable 
to devices having more than two terminals. 

The current induced in an external current loop (or 
magnetic loop or wave guide) by a given distributed 
emf or current (or mmf or magnetization) in a reciprocal 
device can be similarly obtained: 


T= { (PNB, iow xuH)de 


or 


— f (a,-E,— scape, Hee, (7) 


where E; and H; are the fields in the system where 
V;=1, 8=0 and F=0, and u, and M, are the impressed 
current and magnetization (which are equivalent to the 
impressed emf F and mmf 6 and related to them by A 
and xm) which produce the current J; in loop i. Equa- 
tion (7) is a generalization of a theorem proven by 
Schockley.° 


APPLICATIONS 


All the equations derived so far have been exact but 
have required knowledge of the fields in both the per- 
turbed and unperturbed systems. In applying these 
formulas it is necessary to approximate these fields by 
the fields in the unperturbed system. There is no obvious 
way to introduce a higher order correction into the 
theory, and this is a serious limitation in microwave 
problems involving ferrites, where the field perturba- 
tions are likely to be large. Many problems to which 
the theory is applicable can also be treated in other 
ways. Nevertheless, this theory may be useful for some 
future application and can perhaps give more physical 
insight into the effects of various perturbations than 
can other theoretical treatments. 

At low frequencies xm* is finite or zero and \°E 
=puuEX Ho where uy is the Hall mobility and Hp is the 
dc magnetic field. Therefore the Hall voltage 6x, which 
would appear on terminals B if they were open-circuited, 
due to an applied voltage V., for an arbitrary arrange- 
ment of electrodes on a conductor, is given by 


f uno Bax Ho) - Eadv 


bu, 50a 


Va Tbb a 
f E,-cEdo 


This equation can be used to analyze the author's 
method” of measuring the electronic Hall mobility in 
insulating photoconductors. It apparently holds even 
in the presence of space charge effects, provided they 
are linear. The dc case has also been treated by Wick." 





(8) 


®W. Shockley, J. Appl. Phys. 9, 635 (1938). 
1 A. G. Redfield, Phys. Rev. 91, 373 (1953). 
1 Reference 8, footnote 4. 
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Fic. 2. Generalized microwave gyrator. The dotted line S denotes 
a surface which cuts completely across the gyrator. 


For microwave gyrator applications it is useful to 
consider the arrangement of Fig. 2. The gyrator section 
may be a waveguide or cavity containing a ferrite or 
an ionized gas in a de magnetic field, and the attenuator 
and coupling sections are assumed reciprocal. The 
gyrator and attenuator sections are assumed to be 
invariant under a 90° rotation about the horizontal 
axis so that there are two similar orthogonal modes of 
transmission (which we will call 1 and 2) in each direc- 
tion. Mode 1 is defined as the mode excited by loop A 
when the dc magnetic field and magnetization are zero, 
and the section coupling loop B can be rotated 90° 
about the horizontal axis with respect to the rest of the 
system so that it is coupled with either mode 1 or 
mode 2. J»; denotes the current in loop B when the 
coupling section is in position 1; J,2: denotes the current 
in loop B when the coupling section is in position 2. 
Ty2 is zero unless the dc magnetic field induces coupling 
between the two modes, and its magnitude is propor- 
tional to the resulting rotation of the microwaves in 
the gyrator. The coupling sections present admittances 
2oaa and 20%, to the loops A and B, so that the generators 
see admittances gag and cy». Without loss of generality 
we can assume that the coupling and attenuator sections 
have been designed so that oa, and oy» are real, and that 
Ty, has the same phase as V, when the magnetic field 
is off. Z, and Ly are assumed large so that oa and ow, 
do not depend significantly on the rotation of the 
coupling section B with respect to the system. 

We will regard the gyrator as unperturbed when the 
de"magnetic field is off (A*, xm*=0); application of the 
magnetic field introduces a nonreciprocal perturbation 
into the system. 

Using Eq. (6) we have 


Tyo= Vatba2 
=V, J (icoyioEe!Xm*Hs2!’—Ee'X*Ey2"”)dv, (9) 


where o%a2, Es2”’, and Hye” are ova, E,’’ and H,” defined 
as before and evaluated with the loop B in position 2. 
For small gyrations E,’, H,’, E,’”, and H,” can be 
approximated by their values in the unperturbed 
system. 

Now assume that the coupling section is in position 1. 
Generator V, supplies power $V.’¢aa to the system. 
Half of this power is dissipated in the external matching 
admittance 2¢aa. Only a fraction LaLgalgeLl, of the 
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remaining power is dissipated in the external admittance 
2onn ° 


1 — 2 
31 1°/2o00= 4 V Melelelala, 


(10) 


Ty2/To1 = Tba2/ (Gea? vo Lal ol gal gv) , (1 1) 


The power crossing the surface S is given by $V a0aaLalga- 
Equating this expression with that given by integrating 
the Poynting vector normal component over S gives 


so 


Caalalioa=2 Re f (Bax H,*) -ds. (12) 


Applying similar reasoning to eliminate oy,1,L 45, gives 
Toe 


—=0~ 


Te 
J (iouoH, P Xm?Hy2— E, 4 \°E,)dov 





(13) 
4 
| Re f (E.X H,*)-ds Re f (Ex2X His") -ds| 


In cases where © is pure real this expression gives 
the angle of rotation of the gyrator (provided the modes 
of transmission in the gyrator are only onefold sym- 
metric about the axis, i.e., like a TE,; mode); if 0 is 
complex the output of the gyrator is elliptically 
polarized. If the gyrator is cylindrically symmetric this 
formula can presumably be extended to large rotations 
by iteration, as long as the rotation per wavelength is 
much smaller than one radian. Equation (13) does not 
depend on the details of the coupling; all that is required 
is to know the fields in the gyrator for one mode of 
propagation. Frequently E,2 and Hy: can be obtained 
from E, and H, by symmetry. Treatment of higher 
modes in cylindrical gyrators can be accomplished by 
methods similar to the above. Waveguide gyrators 
have also been treated by Suhl and Walker.*® 

If the gyrator is a transmission cavity near resonance, 
it is simpler to use the relation O=o%42/oa1. From con- 
servation of energy and because ¢api™oa1, and oba1 
~ovrai*, it is easy to show that opaimRe Sf (E.*-AEs: 
— iwuoH,*-xmH»1)dv, where the integrand (near reso- 
nance) is negligible except inside the cavity. Assuming 
the cavity is axially symmetric and is symmetric about 
a plane perpendicular to its axis, it follows that 


(9) = (Q/we) f (touoHixm*H.— E,\°E;)d2, (14) 


where Q is the Q of the cavity, ¢ is the energy stored in 
the cavity for fields E,, H; in the cavity, E; and H; 
are the unperturbed fields in the cavity at resonance, 
and E, and H: are obtained from E, and H;, by a 90° 
rotation. To get Eq. (14), we have used the fact that 
Trai is, by the symmetry of the fields, approximately 
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equal to the average power dissipated in the cavity by 
one mode alone. Equation (14) is valid as long as the 
magnetic splitting of the resonance frequency in the 
cavity is small compared to the resonance width. This 
is equivalent to the assumption that 0<1, as will 
be shown below. The effect of generalized media in 
cavities has also been treated by Berk and Lax.” 
Faraday rotation in a transmission cavity can be 
related to the splitting of the degeneracy and the change 
in Q of the cavity in a simple and instructive way. We 
assume that at time ‘=0, we somehow excited a linearly 
polarized mode of oscillation in a cavity; this linearly 
polarized mode can be regarded as the sum of two 
oppositely rotating circularly polarized modes." If the 
degeneracy of the two modes is split by a magnetic 
field, the resulting oscillation will look like a plane 
oscillation whose plane of polarization is slowly rotating 
with an angular frequency equal to one half the angular 
frequency difference between the two circular modes 
Aw. Such an oscillation will be damped out in a mean 
time r= 20/w, so that the angle through which such a 
vibration will turn before being damped out is QAw/w. 
Since the input waveguide is continually exciting these 
two circular modes in equal amplitudes (assuming 
Aw<w/Q) and the output waveguide is sensitive to the 
mean polarization of the resulting precessing and 
decaying oscillation in the cavity, the output waveguide 


2 A. D. Berk and B. Lax, Quart. Prog. Rep., Mass. Inst. Tech. 
Res. Lab. of Electronics, 1952 and 1953. 
( 37. O. Artman and P. E. Tannenwald, Phys. Rev. 91, 1014 
1953). 
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polarization will be oriented at an angle O=QAw/w 
relative to the input. 

To make this derivation slightly more general we 
assume that the two circular modes have a time de- 
pendence 


At+=exp(iws—wit/Q1) and A~=exp(iwet—woet/Qe2). 


If these modes are excited equally by an incoming 
linear mode of the waveguide at frequency w= 4 (w1+w) 
polarized in direction 1, the oscillation in the 1 direction 
in the cavity will have a mean amplitude and phase 
relative to the incoming wave given by 


cf (At++A-) exp(—iet)dt, 
0 


where C is a coupling constant. The mean amplitude 
in the 2 direction will be 


cf i(A+— A) exp(—iwt)dt. 
0 


The complex angle of rotation © is equal to the first 
integral divided by the second: 


O~[20 (wi—w2)/ (wit we) | 
—iL(Qi—Q2)/(Qi1+Q2)], (15) 


provided 01. This equation indicates that a Faraday 


rotation in a cavity should be detectable even when a 
resonance splitting is not. 
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Short-Time Frequency Measurement of Narrow-Band Random Signals in the Presence 
of Wide-Band Noise* 


Peter M. ScHULTHEISS, CONRAD A. WoGRIN, AND FELIX ZWEIG 
Department of Electrical Engineering, Yale University, New Haven, Connecticut 
(Received October 1, 1953) 


In certain applications it is necessary to measure the center frequency of a symmetrical power spectrum 
in the shortest possible time. Two instrumentations for short-time frequency measurement, the autocorre- 
lator and the frequency discriminator, are examined for their ability to accomplish this task. The analysis 
compares the performance of the two systems and indicates the effect of such factors as signal-to-noise ratio 


and filter adjustment. 





INTRODUCTION 


N the measurement of physical quantities by elec- 
trical means, it is not uncommon to translate the 
actual information into frequency shift of a carrier wave. 
The process is essentially one of frequency modulation; 
the information can be recovered at any time by virtue 
of the known one to one relation between the signal’s 
“instantaneous frequency” and the desired physical 
data. While the term “instantaneous frequency” is self- 
contradictory in a sense, this point offers no serious 
obstacle to formal mathematical analysis. The problem 
of frequency modulation in fact has been treated exten- 
sively and rigorously in the literature.' 

The simple process described thus far is often com- 
plicated by the very manner in which the data are 
taken. If the original information is, for instance, sup- 
plied from a source subject to random disturbances, one 
cannot expect the continuous exact correspondence be- 
tween desired information and frequency postulated 
above. If the measured quantity were to remain fixed, 
one would not receive a single frequency in that case 
but rather a power spectrum presumably symmetrical 
about some mean frequency which could be calibrated 
to furnish the required data. The width of this power 
spectrum would vary inversely with the quality of the 
basic measuring process. A perfect process would result 
in a spectrum of zero width, a single frequency, but 
systems of considerable practical importance yield spec- 
tra with a band width equal to 10 percent or 20 percent 
of their mean frequency. 

The power spectrum differs in one important respect 
from single frequency data. The frequency of a sine 
wave can be determined precisely in a relatively short 
time interval, for instance, by measuring the time be- 
tween two successive zeros and assuming periodicity. 
To establish the mean frequency of a power spectrum, 
on the other hand, an infinite length of time is required. 
Observation over any finite period can only determine 
the true (long-time) mean frequency to within certain 

* The contents of this paper are taken in essence from a dis- 
sertation by Peter M. Schultheiss presented to the faculty of the 
Graduate School of Yale University in partial fulfilment of the 
requirements for the Degree of Doctor of Philosophy. 

1See, for instance, S. Goldman, Frequency Analysis, Modula- 


- and Noise (McGraw-Hill Book Company, Inc., New York 
948). 


statistical bounds. Since the physical quantity under 
observation will generally vary as a function of time, 
very definite limits are imposed on the permissible 
observation period and the accuracy attainable in a 
given length of time thus becomes exceedingly impor- 
tant. It is the purpose of this discussion to compare 
various basic techniques of frequency measurement 
with a view towards maximizing the accuracy attainable 
in a predetermined time interval. 

The frequency sensitive element may take a variety 
of forms. It may consist of a conventional frequency 
discriminator, essentially a device which measures two 
points on the power spectrum and deduces the mean 
frequency from symmetry considerations. Alternatively 
one may use an autocorrelator whose operation might 
be described loosely as the time domain analog of the 
power spectrum measurement performed by the dis- 
criminator. Still another method employs a unit record- 
ing the number of zeros or maxima occurring in a given 
length of time. The present paper is concerned only 
with the first two instrumentations because they lend 
themselves to closely similar analytical treatments 
while the third is approached more conveniently from 
a somewhat different point of view. 


A FIGURE OF MERIT FOR SHORT TIME 
FREQUENCY MEASUREMENT 


The operation of a frequency sensitive device must 
clearly depend on the statistical properties of the volt- 
age applied to its input terminals. For the purposes of 
this paper it will be assumed that the input possesses 
a normal amplitude distribution, so that it might be 
thought of as thermal noise after passage through a 
narrow-band filter. This is the simplest possible assump- 
tion, and one which probably comes close to the truth 
for an important class of systems. The actual shape of 
the power spectrum is not very critical as long as the 
center frequency alone is of interest. It is only necessary 
that it be narrow compared to the center frequency and 
symmetrical about it. A Gaussian characteristic (Fig. 1) 
meets both of these requirements and has the additional 
advantage of leading to relatively simple mathematical 
manipulations. 

Before various techniques for short-time frequency 
measurement can be compared, it becomes necessary to 
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Fic. 1. The input power spectrum. 


establish a figure of merit for their performance. If the 
physical quantity being monitored, represented by the 
center frequency fo of the power spectrum, is expected 
to fluctuate over a wide range, it may well be imprac- 
ticable to devise a frequency sensitive element with 
sufficient linear range, and one will have to resort to a 
feedback arrangement such as shown in Fig. 2. This 
procedure becomes almost imperative if the narrow- 
band signal is superimposed on a background of wide- 
band noise, for any noise-limiting filter must clearly be 
centered about fo, which implies either an adjustable 
filter or an essentially fixed fo. Either approach requires 
a feedback scheme but in practice it would probably 
be far more cumbersome to arrange for a frequency 
variant filter than to build the relatively simple circuit 
of Fig. 2 which keeps the mean frequency at the input 
to the frequency sensitive element essentially fixed 
during normal operation. 

In Fig. 2 the frequency sensitive element operates 
effectively as a null seeking device: If the dc loop gain? 
is large because of high amplifier gain at dc, the steady- 
state value of eo must at all times be very close to zero 
while the steady-state value of the output is simply 
related to the center frequency fo. It appears reasonable 
to define as the “best” system for short time frequency 
measurement that which provides the maximum ratio 
of useful output (indication of fo) to unwanted disturb- 
ance (from noise and the random nature of the signal). 
In line with this requirement it is apparent that the 
frequency sensitive device should furnish as large as 
possible a change in steady-state output per unit change 
in fo while minimizing the rms value of the output 
fluctuation. Designating as y1to the variance of ¢9 when 
@=0, and as S the sensitivity to frequency changes for 
the same operating point (i.e., S=0é/dfo|z=0) one 
arrives at the figure of merit (u110/S)! which diminishes 
as the quality of the instrumentation improves. For 
mathematical reasons it will actually be more con- 
venient to work with the square of this ratio, the quan- 

2 “Tc gain” should be interpreted as the loop gain measured by 


breaking the loop at a point such as ¢éo, inserting a dc voltage and 
pocenonne be the return around the loop to the point of the break. 


WOGRIN, 


AND ZWEIG 


tity gito/S*, which will be used throughout the re- 
mainder of the paper.* 

In summary, the present investigation is concerned 
with the relative merits of two devices [ (a) the auto- 
correlator, (b) the frequency discriminator ] for short- 
time measurement of the mean frequency of a power 
spectrum. Both will be treated as isolated units; the 
feedback loop of Fig. 2 was introduced only to show 
that the condition é=0 and small excursions of fy can 
be considered as typical. Under these conditions the 
ratio yito/S* furnishes an intuitively appealing and 
mathematically justifiable figure of merit for compari- 
son of the two instrumentations. Extension of the 
argument to values of é other than zero offers little 
theoretical difficulty but further complicates already 
cumbersome algebraic manipulations. 

The next step must now be the derivation of general 
expressions for the output variance and sensitivity of 
the two frequency sensitive devices. 


THE SHORT-TIME AUTOCORRELATOR 


Autocorrelation and spectral density in the normal 
sense (averaged over an infinite interval) are entirely 
equivalent concepts: One can be obtained from the 
other by the well-known Wiener-Khintchine relation. 
The short-term equivalents of the two quantities on the 
other hand are not necessarily identical in information 
content.‘ It appears pertinent, therefore, to investigate 
both the short-time autocorrelator and the standard 
frequency discriminator and to compare their perform- 
ance under the specified conditions. 

The term short-time autocorrelation [y,(r)] will be 
used to designate the output of the device shown in 
Fig. 3. Its average output is, of course, simply the true 
autocorrelation ¥(r) of the input signal so that the 
sensitivity computation offers no difficulty. The output 
variance is most readily obtained by integrating the 
cortinuous portion of the output power spectrum W(/f). 


ane f Wo fds. (1) 


0 


The problem therefore reduces to the determination 
of Wo(f), a task readily accomplished with a method 
developed by S. O. Rice.* 


. frequenc 
u mixer | senuiti ve 
element 


local 
osc. 


output 

















Amplitier 


Fic. 2. Closed loop instrumentation for frequency measurement. 


3 The figure of merit may appear to have been chosen on rather 
intuitive grounds. More formal arguments justifying its selection 
are contained in the Appendix. 

*U. Fano, J. Acoust. Soc. Am. 22, 546 (1950). 

5S. O. Rice, Bell System Tech. J. 23, 282 (1944); 24, 46 (1945). 
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SHORT-TIME FREQUENCY 


If the input signal V(¢) is normally distributed, it 
may be represented by means of a Fourier series 


V(iO)= > Cx COS(Wml—Pm), (2) 


m=l1 


where C,,?=2w(fm)Af, @m=2tfm, fm=mAf, and w(f) is 
the power spectrum of V(t). Af is a small increment of 
frequency and ¢; --- ¢,, are independent random phase 
angles. 

Then 


V()V(I—-7)= > CnC, COS (Wnt — $m) 


m=l1 n=1 


X cos (wal—wnT—on) 


lwo w 
_ : > > CuCn{cosl (wm—wn)e 


Zm=l1 n=l 


—Pmtontownt | 
+cosl(wmtwn)t—dm—Gn—WnT |}. (3) 


The first term of Eq. (3) contains modulation prod- 
ucts of the difference type, the second term modulation 
products of the sum type. With a narrow-band input 
centered at a relatively large frequency (fo) only the 
former can contribute significant power near zero fre- 
quency. The sum terms will contribute a spectrum 
centered about 2/9 which will be effectively eliminated 
by the low-pass filter. However, in the interest of 
greater generality both terms will be retained as long 
as the ensuing mathematical complexity does not be- 
come prohibitive. 

The difference terms contribute power in the fre- 
quency range fx, fx +Af when m=k+1, n=/ or n=k+1, 
m=l. For a particular value of / the two terms in 
question are 


$CuriCifcosl (wiy1—wr)t— deri t+or+air | 
+cosl (wi41—wr)t—depit+$1—wr4i7 }} 
= $CiiCif (coswyr+ coswp 417) COS (wil — e411) 
— (sinw,7— sinw,417)sin (wit—Gi41+1)}. (4) 


The average power contributed by this particular set 
of terms is given by the expression 


8(Cu4iCr)*{ (cosayr+ cosw,.417)?+ (sinwyr— sinw,47)} 
=41(Cy4:C))*{1+cos(witwe,s)7}. (5) 


The corresponding terms for other values of / are ran- 
dom in phase relative to the above and to each other. 
Hence the total average power in the range fi, f.+Af 
due to the difference terms is obtained by summing 
Eq. (5) over all J. 


1 x 
Av. —— > Cig PC? (1+ coswer,47). (6) 


l=1 
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Fic. 3. The short-time autocorrelator. 





The coefficients C;,,/2 and C? may now be replaced by 
their equivalents 2w(f;.4:)Af and 2w(f1)Af, respectively. 
If the frequency increment Af becomes exceedingly 
small, the summation in / approaches an integral over 
all positive f and may be written in the following form. 


Av. poweraf f w(fet-fw() 
° X[1+-cos2e(2f+fx)r df. (7) 


Analysis of the sum type modulation products pro- 
ceeds in analogous manner. Here one begins with the 
pair of terms m=k—l, n=l, and m=1, n=k—l. The 
summation in / now extends only over the range (1,k) 
and in doing so covers every term twice. Otherwise the 
computation is straightforward so that the power spec- 
trum W(f) of V(t)V (t—7) can be written down without 
further difficulty. 


w= w(x)w(f+-x)[1+cos2x(2x+ f)7 ]dx 


1 
+5 [ wayu(f—a)C1-+cos2e(2e— fre. (8) 


If one extends the definition of w(x) to negative fre- 
quencies in such a manner that w(—x)=w(x), one can 
easily establish that Eq. (8) may be converted to the 
more compact form 


1 eo 
win f dxw(x)w(f—x)[1+cos2x(2x—f)r]. (9) 


The power spectrum Wo(/) after the low-pass filter 
is readily obtained from Eq. (9) since the filter is pre- 
sumably a linear device with transfer function Fo(27/jf). 


Wo(f)= | Fo(2xjf) |?W(f). (10) 


As soon as w(f) and Fo(2zjf) are given, the analysis 
will be reduced to a purely computational problem. 


THE FREQUENCY DISCRIMINATOR 


A simple form of the standard frequency discrimi- 
nator is shown in Fig. 4. F; and F2 are two band pass 
filters with center frequencies f,; and f2 arranged sym- 
metrically about the nominal mean frequency of the 
device. Square law detectors have been substituted for 
the more conventional linear rectifiers as a matter of 
mathematical convenience. This choice should not affect 
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the basic results to any appreciable degree. One may 
think of the system as operating by comparing the 
mean square voltage in the pass bands of the two filters. 
If F, and F, are made identical except for their center 
frequencies, the discriminator will indicate a mean fre- 
quency coinciding roughly with the center of mass of 
the incoming power spectrum. If the spectrum is sym- 
metrical and unimodal, that quantity will, of course, be 
identical with the frequency of the peak. 

The analysis now proceeds much as in the case of the 
short-time autocorrelator. Let the input voltage e, be 
represented again by the Fourier series of Eq. (2), and 
let the filter characteristics take the form 


e2() _ s(us)eitito (11)| 








Cree | Fy, Fs, v1, 2 real 
es(w) | functions of w. 
——=F,(w)ei) (12) | 

€:(w) 


It is an easy matter to write és in terms of these 
quantities in form of sum and difference frequencies as 
in Eq. (3). The power spectrum We(/) of e¢ is then 
again derived by separate examination of the sum and 
difference components. The details of this development 
are omitted here because it is algebraically tedious 
without offering any theoretical difficulties worthy of 
mention. 


W(f)= 2f {F2(f+2)F2(«)+F2(f+«)F (x) 


— 2F (f+) Fi(x)F2(f+)F2(x) 
Xcoslyi(f+x) —¥(«) —v2(f+2) 
+2(x) ]}w(f+x)w(x)dx 


I 
+f weU-ore@)+Feg-2F2@) 


— 2F (x) Fi(f—x)F2(«)F2(f—<x) 
Xcoslpi(f—x)+¥1(x) —v2(f—~x) 
—y2(x) }}w(f—x)w(x)dx. (13) 


In complete analogy with Eq. (8), Eq. (13) can be 
converted to a more compact form if one assumes 
w(—x)=w(zx), F\(—«)=F;,(x), F,(—x)=F;(x), ¥i(—x) 
= —y(x), vo(—x)=—ye(x). The last four conditions 























Fic. 4. The frequency discriminator. 
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are simply those of normal realizability in physical 
filters. 


Wo(f)= f {F?(f—x)F P(x) +F 2 (f—*)F 2 (x) 


—2F\(f—x)F (x) F2(f—x)F2(x) 
X coslyi( f—x)+¥1(x) —v2(f—») 
—wo(x) }}w( f—x)w(x)dx. (14) 


The output power spectrum W;(/), finally, is obtained 
from the relation 


W2(f)= | Fo(2mjf)|*Wel(f), (15) 
where Fo(2mjf) is the low-pass filter transfer function. 


AUTOCORRELATOR AND DISCRIMINATOR 
PERFORMANCE IN THE ABSENCE OF 
WIDE BAND NOISE 


Equations (9) and (13) will serve as the starting point 
for a discussion of system performance in response to 
an input consisting only of the narrow signal spectrum 
described in the introduction. 


Y fanf 2 
(2m)ta | 7 20° 


+ex0| - “Fl - (16) 


o 








w(x)= 





Consider first the short-time autocorrelator. Substi- 
tuting Eq. (16) into Eq. (9) and integrating, one obtains 
after some algebraic manipulation 


9 





W(f)= |2[1-+ exp(—da"rot cos4rfor | 


4(1)'o 


2 
xexp(—=,) +L ten(—4etee9] 
3 


[oS ) 


soo( AY]. on 


The function W(f) is sketched in Fig. 5. Any subse- 
quent low-pass filter of narrow band width will effec- 
tively eliminate the double frequency portions of the 
spectrum, so that only the first term in Eq. (17) need 
be considered. That portion of W(f) could also have 
been obtained by considering only the first term of 
Eq. (8) and using the input spectrum 


Wo (x—fo)? 
7 18 
(2m)to ex| 20? (8) 








w(x)= 
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Because of the resultant reduction in complexity, 
that procedure will be employed in future calculations.® 

The significant term of Eq. (17) has a form very 
important to the success of an autocorrelator instru- 
mentation. 


vo 


2(x)4o 


W(N= 





[1+ exp(—42*7’o?) cos4afor | 


xew(-2): (19) 


Considered as a function of r the cosine term assumes 
the value 


7 
—exp(—41°*7’o") = exp] —~ —(2n+1) 
4 fe 


1 
at saat *e=0,1,2,---. 


0 


With c</fo, this value is very nearly equal to —1 for 
the lowest . It follows that W(f/) and hence the rms 
output fluctuation exhibits a series of minima near the 
points r= (1/4fo)(2n+1). The first of these (n=0) is 
the sharpest and hence represents an optimum oper- 
ating condition from this point of view. 
The dc autocorrelator output is given by the ex- 
pression 
(7) =o exp(— 2270") cos2mfor. (20) 


The sensitivity S is therefore given by 


™ oy (7, fo) 


fo 


S 





= — 2rry exp(—27’7’o") sin2mfor. (21) 


This quantity reaches a maximum near r= (1/4fo) 
X (2n+1), n=0, 1,2 ---. The maximum for r1/4fo 
is the highest because of the exponential function in 
Eq. (21). Thus the rms fluctuation reaches a minimum 
essentially at the same point as the sensitivity attains 
its maximum value, so that the figure of merit j110/S? 
is clearly optimized at this point. It follows that the 
autocorrelator should be used as a null device operating 
about the first null of the true autocorrelation function 
¥(r). It is therefore well suited for operation in the 
circuit of Fig. 2. The time delay 7 should be so adjusted 
that the relation r=1/4fo is readily satisfied when all 
loop components are operating within their linear ranges. 

The autocorrelator figure of merit is now easily com- 
puted from Eqs. (10), (18), and (21). For simplicity in 
calculation a Gaussian low-pass filter will be used, 
although equivalent computations for physically realiz- 


* A slight discrepancy between the two methods arises from 
the fact that Eq. (18) does not vanish identically for x<0. How- 
ever, with e<fo, one may always replace an integral of the form 


| exp| - oe ax by fet exp| - eer lax 


without introducing any significant error. 








fe+2f, Ff 


Fic. 5. Autocorrelator output spectrum. 





able filter characteristics could be carried out without 
undue difficulty. 


| Fo(2xjf) |?= en{ -— ooo. (22) 


Then, 


_— f | Fo(2nif)|2W (fap 


" Yoo 
r [1+-exp(— 42°70") cos4afor]. (23) 


o 


oz 





Hence, the figure of merit at the operating null is given 


by the expression 
rT o 
v2{éed 1—exp( —~ -)] 
4 fe 
- (24 


119 


Ss? 


r=1/4fo 





) 





Equation (24) has dimensions of [frequency }. This 
is reasonable because it represents the mean square 
fluctuation of the instantaneous measurement of fo 
about its average value. An absolute figure of merit, 
the fractional error in short-time measurement, is ob- 
tained by normalizing with respect to f;?. Dividing 
Eq. (24) by this factor and using the numerical ratio 
a/ fo=0.15, one obtains the result 


F119 oo 
=0.00825—- (25) 


oe oC 





This figure must now be compared with the corre- 
sponding quantity for the discriminator instrumentation. 

Before the discriminator output spectrum can be 
evaluated from Eq. (13) or (14), the two filter charac- 
teristics must be specified. Since the exact nature of the 
amplitude functions F\(x) and F2(x) should not influ- 
ence such design considerations as filter band width and 
separation very critically, a Gaussian amplitude char- 
acteristic will be assumed. 














r  (x—f)?) 
F? = = 
@) as | 2ef J 
(26) 
Fi(2)mexp| —2—™)]. 
; ™| 2ef7 J 
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The question of a reasonable phase function remains 
to be decided. It is not possible to compute a con- 
sistent phase function from Bode’s integral because the 
Gaussian is not physically realizable and the resulting 
integral diverges. The choice must therefore be some- 
what arbitrary. A brief inspection of Eq. (13) reveals 
that this difficulty is not as serious as it might appear, 
for the phase function enters into the significant first 
term only in form of the difference ¥1(/+*)—y:(x) 
—[Wo( f+x)—wW2(x) ]. With a low-pass filter of less than 
1 cps band width, all f of interest are exceedingly small. 
The discriminator filters, with band widths of the order 
of 100 cps or more, cannot exhibit appreciable phase 
shift over so short an interval, and if the two filters are 
similar, even these small phase shifts tend to cancel. 
An approximation for ¥; and y2 using only the first 
terms of a power series expansion should therefore be 
more than ample. 


¥i(x)=—a(x—fi), Yo(x)=—a(x—fz), 


Then the argument of the cosine term in Eq. (13) is 
zero and the equation is simplified considerably. Equa- 
tions (26) and (27) may now be substituted into the 
first term of Eq. (13). In the low-frequency range the 
spectrum then assumes the form 


a>0. (27) 














vorp (h-f ] | a 
W.(f)= - + _ 
cL a/ 1a? [exp +a? ad +o? 
(fi— fe)? (fm— fo)? 
-_ exp — 4o/ exp — +o? | 
| 
x {exe| ---]}. (28) 
where 
aay _ Sith 
p= . 





roar 2 


The power spectrum after the low-pass filter is ob- 
tained from Eq. (15), Eq. (22) being used to represent 
the low-pass filter characteristic. The output variance 
follows immediately from integration of W7(f). 


Vora op 





Variance= f W; Vdf=—, 











oi . _Go-f ", - © (fo- ~"] 
7 +a? 4 +a’. 
(fi— fe)? (fm— fo)* 
—2 exp| ah ls «|-—— |}. (29) 
oT OF 


where use has been made of the fact that ox<p. The 
dc output é@; of the discriminator is given by the ex- 
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pression 


a f LFe(f)—F#(f)w( Nef 


=“ lexo| - Uf ae exp| - 
o 2(c?+<¢/’) 


The sensitivity is obtained by differentiating Eq. (30) 
with respect to fo and the figure of merit for the oper- 
ating null can now be calculated without difficulty. 
After some simplification it takes the form 


(fo— fz)? = |l. 
30 
2(0?+0/) ” 








M119 V200(0? +07)" 
S? | fo=sSm oos(fi—fe)? 
(fi— fe)? 
xX | 1-—exp}] -——————_ 
of (31) 
4—(o?-+0/’) 
oe? 


An attempt to minimize Eq. (31) with respect to the 
filter separation (/;—f2) leads to the result (fi—f2) > @. 
In physical terms this implies that the sensitivity falls 
off less rapidly than the output fluctuation so that the 
theoretical optimum performance would be obtained 
with the widest possible filter separation. Since some 
noise is always present in fact, such an adjustment 
would be entirely unrealistic. No complete evaluation 
of the frequency discriminator can therefore be made 
before the general case, including noise as well as signal, 
has been considered. This is done in the following sec- 
tion. For the time being, in the interest of a preliminary 
comparison with the autocorrelator, an arbitrary figure, 
justified only by the argument of the subsequent sec- 
tion, will be assumed. Using (/1—f2)?=607, c=0.15 fo, 
and normalizing Eq. (31) with respect to fo? one obtains 


the figure of merit 
of 5/2 
(+3) 
oo o 








Milo 
=0.0053— ———_———_ 
f°S?| fo=Im o oF 
o 
1.5 
X |1—exp| — 


off of\ te (32) 
02) 

a a 

A plot of Eq. (32) is given in the lowest curve of Fig. 


10. The minimum occurs at ¢/o;=0 and has the value 


F119 on 
=0.0079—- (33) 
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SHORT-TIME FREQUENCY MEASUREMENT 


This compares quite closely with the corresponding 
figure [Eq. (25) ] for the autocorrelator. It should be 
pointed out, however, that for all practical purposes in 
the autocorrelator the condition of best figure of merit 
coincides with that of maximum sensitivity, while in 
the discriminator the sensitivity approaches zero as the 
figure of merit reaches its minimum. This argument, 
apparently in favor of the autocorrelator, is of course 
strictly preliminary and subject to change when inputs 
with finite signal-to-noise ratio are considered. 


AUTOCORRELATOR AND DISCRIMINATOR 
PERFORMANCE IN THE PRESENCE OF 
WIDE-BAND NOISE 

The present section discusses the most general case, 
where the system input consists of wide-band noise 
superimposed on the narrow-band signal considered 
thus far. Unfortunately the complexity of mathematical 
manipulation is such as to make it impractical to incor- 
porate detailed derivations in this paper. After a state- 
ment of the initial equations the result will therefore 
be given immediately in the most general form avail- 
able. Thereafter a particular ratio of signal band-width 
to center frequency as well as a narrow low-pass filter 
will be assumed in order to obtain a meaningful com- 
parison under typical operating conditions. Since the 
difficulties alluded to are entirely due to algebraic com- 
plexity, the reader should have no trouble in modifying 
the general results to fit other particular situations. 

If signal and noise are independent, the input power 
spectrum can be represented by the expression 


(f—fo)? 
= to a er 








w(f)= |. (34) 


Note that y, is the noise power per unit frequency 
whereas Wo is the total signal power. The total noise 
power represented by Eq. (34) is infinite, hence the 
autocorrelator must be equipped with a frequency selec- 
tive filter if the output fluctuation is to remain within 
tolerable limits. The circuit of Fig. 3 is therefore modi- 
fied and now assumes the form of Fig. 6. The band- 
pass filter is tuned to the frequency at which the delay 
line introduces 90° phase shift and is assumed to have 
the Gaussian transfer function 





<= (35) 


2o7 


lA) I=ex0| - 


It is immediately evident that the voltage V; in Fig. 6 
is normally distributed in amplitude and has the power 
spectrum 





Sf * (36) 


2e f 


wi(f)=w(f) exp| — 


If Eq. (36) is interpreted as the input power spec- 
trum, the first term of Eq. (8) may be used without 
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Fic. 6. Short-time auto-correlator with noise limiting filter. 


modification to represent the low-frequency portion 
W(f) of the spectrum ahead of the low-pass filter. 
Formal substitution of Eqs. (36) and (34) into the first 
term of Eq. (8) leads to the result 


vo Of (fo—fi)? 
wif)= |---| 
2(x)} o(c?+o/7)! es toa? 


X {1+ exp[—42°7?(o*k a) ] cos4a(fo*fi)T} 


( a) 
2 











(fi—fo)* 
o is 2(c?+0/*) 
” cxf _ rtf a) 
2(o°ko/P+o/) 


of 
| 1+exn( — 82770? —) 


exkae—aof 


xc0s( 2 7 
f o 





of 


+4ar o* 1 , 1 
atte senen) | 





- (fi—f—fo* fi)? | 
2(e?ko/P+o,/’) 


of 
x | 1+exp(—8r'rot~) 








2 gtkg? 

Xc0s( ef —+4(itn'h) || 

Pkaf+o? 
+ (1) opp.2{1+exp[—42?7o,/7] 
f? 

Xcos4rfir} exe| ——} (37) 
where 

fof eo akete 

a+ oy a+ oy 


The computation of output variance now requires 
only multiplication of Eq. (37) by the square of the 
low-pass filter characteristic (Eq. (22)) and integration 
over all positive f. This process becomes elementary 
once one recognizes that the two terms in yop, are 
symmetrical with respect to f=0 and can be treated 
as a unit. Under assumption of a narrow low-pass filter 
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(ove) the result is 
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The sensitivity remains to be computed. Since wide- 
band noise does not contribute any dc output, the 





























: . 7s70 (fo—fi)? sensitivity can be calculated from the signal autocorre- 
variance=e exp] ~ lation alone 
2v20(0?+<0,/*)! +o? ; 
2)4 
oo, dy (7, fo) (of *o*) ° P 
= 2 * =o exp[_— 2x*r oxo; 5] 
x L+exn| 4n*r , | cos4arfo* fi afo i 
2 as 2 
LvolDes) (21) tesa (fi—fo)? 7 x | aes. exp| | 
” ) supamees  \redieauecmemmeres 2 2 2 2 
: a(20?+0/*)3 " 2(0?+0/) J ' bins saali 
—f 
te (fi—fo* fi)? Xsin2z (fo* Arte . 
x = . 
, 2(e?ko/?+o/) of 
oto - a re(fotf:) | (39) 
exp] —————— | cos2r The 
x | L+exp| - tt ™ 2(0?+-<¢/*) _ 
o* 
After Eqs. (38) and (39) have been evaluated at the 
7 7 operating null r= 1/4fo>=1/4/; the usual figure of merit 
* * — es 
Xcostar(forfi)"fi + W0) P" can be computed without difficulty. Normalizing with 
respect to fo? and assuming the numerical relation 
X {1+exp[—42?7’o/7] cos4afir}. (38) o=0.15 fo one obtains the relation 
variance Hi 19 
fe (sensitivity)? fers? 
1 1 0.0555 Yvo\, (27)! 0.111 no 0.0555 
aera Gael eal GG) 1-1 
oo 2V2 (1+)! 1+x Yo 7 (1+2x)! 1+2x x 
o 1 0.0555 
2.46——— exp| - | 
(1+<)3 i+x (40) 
h tte 
where . cs 
r=—- 
of 


Note that Yyo/yWo-is the effective noise-to-signal ratio 
since Yo is the total noise power present in a band 
equivalent to that of the signal. The definition, of 
course, involves a certain degree of arbitrariness be- 
cause the signal power is distributed in a Gaussian 
manner whereas the noise spectrum is flat. 

Equation (40) may be used to compute optimum 
values of x for various noise levels. The process is a 
purely numerical one and the results are plotted in Figs. 
7, 8, and 9. Figure 7 shows plots of the figure of merit 
versus filter band width with the signal-to-noise ratio as 
a parameter. It will be observed that the minima shift 
to narrower filter band widths as the signal-to-noise ratio 
decreases. The required change in band width for opti- 
mum performance is not very drastic, however, if one 
excludes from consideration noise levels below 10 per- 
cent of the signal. Thus between pyo/~o=0.1 and 
¥va/Po=1.0 the ratio ¢/a; should change only by a 
factor (3)!. This information is readily obtained from 
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Fic. 7. Autocorrelator steady-state performance. 
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Fig. 8, which shows the proper filter adjustment as a 
function of noise-to-signal ratio. The use of linear scales 
emphasizes the relatively slight filter adjustments in the 
range normally of interest. 

In practice it will not be possible, of course, to adjust 
the filter band width for an optimum at every signal-to- 
noise ratio. One will probably choose the filter so as to 
optimize performance at the highest expected noise 
level since the tracking difficulties are most serious 
under that condition. The reduction in accuracy result- 
ing from this procedure is indicated in Fig. 9. Evidently 
the loss is serious only at very low noise levels, in a 
range unlikely to be of great practical importance. 

Figure 9 also suggests a rapid increase in tracking 
difficulty as the signal-to-noise ratio approaches unity. 
The fluctuation rises almost with the second power of 
Yvo/Wo in that neighborhood. 

The low frequency output spectrum of the frequency 
discriminator can be obtained directly from the first 
term of Eq. (13). Equation (34) is now used to represent 
the input spectrum, otherwise the procedure does not 
differ from that for the noise-free case. After consid- 
erable mathematical manipulation the result assumes 
the form 
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Fic. 8. Autocorrelator filter adjustment for optimum performance. 
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Assuming again a narrow Gaussian low-pass filter 
[Eq. (22)] and integrating over all positive f, one ob- 
tains the desired output variance 
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Fic. 9. Dependence of autocorrelator performance on 
noise-to-signal ratio. 
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The sensitivity to center frequency changes is clearly 
unaffected by the presence of wide-band noise so that 
the derivative of Eq. (30) remains applicable. The usual 
performance criterion variance/S* is now obtained by 
dividing the square of this sensitivity into (42) and 
normalizing with respect to fo. 
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Fic. 10. Discriminator steady-state performance 
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Fic. 11. Dependence of discriminator performance on 
noise-to-signal ratio. 


Equation (43) is analogous to (40) in the autocorrelator 
discussion. However, its interpretation is more difficult 
because two parameters, the filter band width o, and 
separation (/:—f2), may be varied independently. Thus 
a set of curves is required to describe completely the 
performance at any one signal-to-noise ratio. The prob- 
lem is simplified considerably by the practical consid- 
eration that all filter adjustments will have to be 
permanent. Performance can thus be optimized for any 
preassigned operating condition, such as the poorest 
signal-to-noise ratio expected, but once this choice has 
been made a decline in performance relative to the ideal 
value must be tolerated elsewhere. Selection of the 
proper operating point for this adjustment is a practical 
matter, determined largely by the anticipated duty 
cycle and disturbances. In the absence of quantitative 
information one tends to optimize performance under 
the worst possible conditions. This procedure will be 
adopted here and a 1:1 signal-to-noise ratio will be 
assumed.’ 

Even with this simplification, the selection of param- 
eters from Eq. (43) remains basically a numerical 
process, for differentiation with respect to either a; or 
(f:—f2) does not lead to an expression readily inter- 
preted in algebraic form. Rough calculations indicate 

7One may easily conceive of a situation where this technique 
would not lead to the best result. If, for instance, the poorest 
signal-to-noise ratio were to arise only for rare and brief intervals, 
one would probably gain by tolerating large errors during these 
periods in return for better performance at other times. In such 
a case one could always return to Eq. (43) in an effort to strike 


the most advantageous compromise by numerical computation, 
a tedious but straightforward process. 
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Fic. 12. Effect of filter separation on discriminator 
performance (os=¢). 


that the filter separation should approach the value 
providing maximum sensitivity as the noise level be- 
comes high. This condition is reached when the follow- 
ing relation is satisfied 


(fi-—fe) =2(P?+0/)'. (44) 


At lower noise levels the optimum spacing is some- 
what larger, but the increase does not become very 
significant until the signal-to-noise ratio exceeds 10. 
Thus one may safely conclude that the adjustment 
should be close to the figure demanded by Eq. (44). 
Preliminary calculations also indicate that the filter 
band width for high noise levels should be approxi- 
mately o/v2.' With these considerations in mind, the 
filter separation, the less critical of the two independent 
variables, was fixed at 


Substitution of (45) into (43) leads to a function of 
the single variable o; which may now be evaluated for 
various signal-to-noise ratios. The result is shown in 
Fig. 10. The independent variable «=o7/o/7 has been 
used to facilitate comparison with the autocorrelator 
(Fig. 7). 

Inspection of Figs. 7 and 10 leads to two conclusions: 


(1) If both systems are adjusted for optimum opera- 
tion at a given signal-to-noise ratio, their performance 
under that condition will be almost entirely equivalent. 

(2) The autocorrelator is more sensitive to adjust- 
ment than the frequency discriminator. The latter has 
characteristics essentially flat over a 10:1 range of x 
while the autocorrelator characteristics exhibit consid- 
erable variation over a similar range. If the system is 
subjected to operating conditions other than those 
assumed for adjustment purposes, the discriminator is 
therefore preferable. This effect may become quite pro- 
nounced, particularly if a change in the standard devia- 
tion o of the input power spectrum shifts the operating 
point far from its nominal value. Comparison of Figs. 7 
and 10 reveals that a 3:1 or 4:1 advantage in favor of 
the discriminator may easily arise in such a situation. 


8 Filter band width for maximum dc sensitivity is os=¢. 


Figure 11, like its counterpart 9, indicates the re- 
duction in discriminator accuracy resulting from the 
adjustment of parameters at a single assumed operating 
condition, here a 1:1 signal-to-noise ratio. As suggested 
in the above discussion, the loss is almost negligible 
throughout the region of practical interest. 

Finally in Fig. 12, an attempt is made to investigate 
the dependence of Fig. 10 on filter separation. It will 
be recalled that the separation was chosen on the basis 
of rough calculations for 1:1 signal-to-noise ratio. 
Figure 12 justifies the selection, for a reduction in 
spacing at that noise level leads only to lower sensi- 
tivity without reduction in the figure of merit whereas 
an increase results in immediate and marked deteriora- 
tion. There is no pronounced minimum in this curve 
in contrast to plots for higher signal-to-noise ratios. 
As pointed out earlier, some improvement can be 
secured by increasing filter separation as the noise level 
diminishes. Figure 12 shows, however, that the benefit 
obtainable by this procedure is quite limited even for 
signal-to-noise ratios as high as 10:1, while the accom- 
panying loss at higher noise levels is disproportionately 
larger. Operation near the point of maximum sensitivity 
therefore appears almost imperative unless the upper 
limit of signal-to-noise ratio is known accurately. 
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APPENDIX 
Derivation of the Figure of Merit 


Consider a particular instrumentation such as the 
discriminator of Fig. 4. Its average output é; is shown 
in Fig. 13, a curve which is approximately linear near 
fo=fm, the operating point in a null seeking system. 
If the discriminator input is a spectrum (Fig. 1), the 
instantaneous output e7(fo,f) will fluctuate about zero. 
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Fic. 13. Average discriminator output as a function of fo. 
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However, for each ¢ there will be a value of fo which 
satisfies e7(fo,/)=0 and it appears reasonable to speak 
of that value as the instantaneous measurement of fo. 
The quality of performance is then determined by the 
probability distribution of the instantaneous measure- 
ment; one may speak of good performance if the dis- 
tribution is narrow and poor performance if it is wide. 
The calculation of this distribution will now be carried 
out in fairly general terms. 

It will be assumed that the narrow-band input spec- 
trum is sufficiently narrow to be well defined but that 
its width is an appreciable percentage of fo (15 percent 
will be used in numerical calculation). If high accu- 
racies are to be obtained (better than 1 percent), the 
data must be smoothed for a rather long time, an opera- 
tion demanding that the pass band of the low-pass filter 
must be very narrow compared to the input spectrum 
width. Under these circumstances Rice" and others 
have demonstrated thdt the filter output assumes an 
essentially normal probability distribution, a circum- 
stance which, under certain reasonable assumptions, 
reduces the determination of the probability density 
for the instantaneous hull to a fairly straightforward 
computational problem. : 

Let the frequency sensitive element have an output 
G(x) which is normally distributed and whose long-time 
average has a null at x=». What is the probability 
density of the coordinate x of an instantaneous null of 
G(x) near x9? 

The computation is based on the following theorem 
due to Rice.” 

If y=F(a1,a9---;x) a function of x with the a’s 
random variables, the; probability of y having a zero 
with positive slope in the interval (*,x+dx) is given by 
the expression f 


dx J np O,n;x)dn, (46) 
0 


where p(£,n;x) is the two-dimensional probability den- 
sity function of the variables £=F(qa,,a2---;x) and 
n=0F/dx. For the problem at hand, é will be identified 
with G(x), » with dG/dx, and the a’s will be random 
parameters such as the random phase angles in Eq. (2). 
Under the assumptions made p(é,n;x) is normal in two 











dimensions: 
a ae ee 
n;x) =———— exp] — —)* 
PLEn eM |i p | M| n—h 
ee eS -*)| (47) 
2|M| — oS 


“Tf the spectrum becomes extremely narrow, the problem 
approximates that of the automatic frequency control circuit, 
solutions for which are well known. 

(1948) O. Rice, Bell System Tech. J. 23, 282 (1944) and 24, 46 

2 See reference 11, p. 51. 
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where |M| =uul22— M12", bn=(2)0—&, w= (1) w— 7, 
Hi2= (En) w— (€)(H). Substituting Eq. (47) into Eq. (46) 
and integrating, one obtains the approximate expres- 
sion for the probability density p(x) of a zero with 
positive slope at « 


~ 


ti ( =~ | “| (48) 
n> ——— nH) 
(2711)! Mul 2uu 


The approximation consists in extension of the range of 
integration to the interval (—*,), justified by the 
observation that the one-dimensional marginal distri- 
bution in 7 will be narrow if a sufficiently large low-pass 
filter is used, or in other words that the occurrence of 
a negative slope is unlikely near the point where a zero 
with positive slope is expected. 

If (G(x))w=£ has a null at x=, the function is 
conveniently represented by means of a Taylor expan- 
sion about that point. 


0 
£=G(x0)+ F (G(x) | (x— 2x0) =fo(x—xXo) (49) 
x 


r=1 
where 

0 

wv=| GO). 

Ox r=279. 

Similarly, 
oF 
I=tot+—| (x—2»). (50) 
X|\x=29 





Substituting Eqs. (49) and (50) into Eq. (48), one 








obtains 

1 Oj Miz 

p(x) =——-} fot | — ——Fjo |(x— x0) 
(2m)! OX|z=2 Mu 

(x— xX)? 
exp] — —j- (51) 
Mil 
iio 


If the system is to be accurate, (u1:/%0?)? must be 
very small compared to x. Then the exponential term 
differs from zero appreciably only for values of x close 
to xo, and one therefore feels that the term 


E 
Ox 
will be negligible compared to 7. Numerical computa- 
tion establishes easily that this is indeed the case for 
the order of magnitude of the parameters under dis- 
cussion. Under the same conditions yu, does not differ 


significantly from its value iio at xo. The distribution 
is then essentially normal and has the variance 


Mi2 
——jo | (x— x0) 
z= Hil 
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On the Distributions of Signals and Noise after Rectification and Filtering 
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The probability distributions for broad- and narrow-band signals and normal random noise, following a 
square-law rectifier and a video (or audio) filter of arbitrary width, are examined. The present approach is 
based on a method originally used by Kac and Seigert [J. Appl. Phys. 18, 383 (1949) ]] for noise alone (after 
quadratic rectification but no filter) in the case of the mth-order distribution (m >2). The procedure requires 
an appropriate transformation to express the output waveform in terms of the input disturbance; the 
statistics of the output are now determined by a suitable transformation with respect to the original, 
! normal statistics. Explicit solutions are obtained for an integral equation involving the autocorrelation func- 
tion of the noise, the weighting function of the video (or audio) filter, and an orthonormal set of eigen- 
| functions. For noise alone only the eigenvalues need be determined, but for a signal and noise it is necessary 
to know the eigenfunctions as well. Among the new results are (1), the calculation of the general, nth-order 
characteristic function for the filtered output following quadratic rectification for an input noise and a signal 
) and noise; (2), a discussion of the second-order probability density W2 in this case; (3), some examples of 
practical interest; and (4), the limiting cases of broad and narrow post-detection filters (vis-a-vis the 
predetection filter), with particular attention to approximations to W?2 for signal and noise for narrow videos 





and an improved approximation for W; for noise alone, as well. 





1. INTRODUCTION 


N most electronic systems we are concerned with the 
observation of some physical quantity. This quan- 
tity may be random, or it may have a _ well- 
defined representation in time and may be accom- 
panied by noise, which can interfere with the measure- 
ment. Here we consider as typical inputs to our elec- 
tronic system either normal random noise, or a normal 
random signal (the quantity of interest), or a combina- 
tion of normal random noise and a signal with a definite 
structure. It is assumed, in addition, that the random 
processes are stationary, and in some cases, ergodic. 
To obtain the desired information from the input, a 
generalized filter is used which may be active or passive 
and which may contain both linear and nonlinear filters. 
One such generalized filter is the combination of an IF 
amplifier followed by a quadratic detector, which in 
turn precedes a second, linear (“‘video”’) filter. Arrange- 
ments of this type are common in radar receivers, for 
example, and are also basic for power-spectrum ana- 
lyzers. In the present paper we attempt to determine 
the statistical properties of the video filter’s output, 
since from these statistics the necessary description of 
the observations can be given, and many of the ob- 
served properties of the original wave obtained.’~* 
A random process can be completely described by a 
set Wi, ---, W, of probability (density) functions.‘ 
Here W, is the mth-order probability density function 


1S. O. Rice, Bell System Tech. J. 23, 282 (1944); 24, 46 (1945). 
?:D. Middleton, Quart. Appl. Math. 5, 445 (1948); see also 
J. Appl. Phys. 22, 1143, 1153 (1951). 
ne Johnson, and Middleton, J. Appl. Phys. 23, 377 
1952). 
246) C. Wang and G. E. Uhlenbeck, Revs. Modern Phys. 17, 323 
1946). 


ability of finding an m-tuple of values of X in the ranges 
(X1,X1+dX;), «++, (Xn,Xn+dX,), at the successive 
times /;(=0), fo, ts, -- -, etc. Since the process is assumed 
stationary, only the time differences between pairs of 
values of X are significant. An important consequence 
of the normal structure of the input wave is that the 
set Wi, ---, W, is completely specified if the (co-) 
variances of the input wave are known. (Here the means 
are assumed equal to zero.) Thus, if the statistics of the 
input wave are given, it should be possible in principle 
to calculate the statistics anywhere else in the system. 

The problem considered here is the effect on the 
statistics of the normal process, when the incoming 
random wave is subjected to a linear filtering operation, 
represented by the IF amplifier (Fig. 1), rectified by a 
quadratic detector, and then passed again through a 
second linear filter, which we shall call the “video.” In 
particular, we wish the second-order probability density 
W >? for the output of this second linear filter-—an output 
which will not in general possess normal properties, 
because of the intervening nonlinear operation. We use 
here a generalization of a method first introduced by 
Kac and Siegert,’ who obtained the first-order density 
W, for noise and for a signal and noise. Before this, 
earlier work in the unfiltered (i.e., infinitely wide- 
video) case!” has given us W; and W; in a direct way, 
for all nonlinear elements. Kac and Siegert showed, 
however, for the first time, that explicit results in the 
much more difficult situation of arbitrary post-detection 
filtering could be obtained, at least for the full-wave 
quadratic rectifier. Their results were subsequently 
verified by Jastram® in certain instances. It turns out 


5M. Kac and A. J. F. Siegert, J. Appl. Phys. 18, 383 (1949). 

*P. Jastram, ‘The effect of nonlinearity and frequency dis- 
tortion on the amplitude distribution for stationary random 
processes,” doctoral dissertation, University of Michigan (1947). 
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that W, can be represented explicitly,’ provided the 
eigenvalues of a certain integral equation can be found. 
For signals and noise both the eigenvalues and the 
associated eigenfunctions of this integral equation are 
needed, and, moreover, only the characteristic function 
of the distribution is expressible in convenient form. 
Approximations for W,; have also been developed in the 
two extremes of video filters wide or narrow compared 
with the first, or IF response; see Fig. 1. 

The choice of a quadratic rectifier is important for 
several reasons: (1) In receiving systems which must 
reproduce adequately over extended dynamic ranges, 
the minimum (or threshold) signal, and therefore the 
noise, is confined to the lower portions of the rectifier 
characteristic, which for a wide class of practical de- 
tectors approximates a square-law® response. (2) Often 
one desires a quantity proportional to the instantaneous 
power in a random wave, and this requires usually a 
quadratic rectifier. (3) From a purely mathematical 
point of view, the quadratic rectifier is much easier to 
treat analytically than other nonlinear devices, and in 
fact, seems at the moment to be the only response for 
which more or less explicit solutions can be found. (4) 
Finally, a study of W2 (as well as W;) after rectification 
and filtering provides results useful in the problem of 
determining, say, the correlation function or spectrum 
of the output of a second nonlinear device, following the 
video filter. Methods to date have assumed that any 
filters between successive nonlinear operations are either 
very broad or very narrow, in which case standard 
procedures':? may be used. 

Kac and Siegert achieved their results in two ways. 
Their first approach, which we shall call the “expansion 
method,” develops the input random wave in a series 
using an appropriate set of orthonormal functions. 
However, this method does not appear possible in 
higher-order problems (W2, etc.), since the description 
by such a set is not general enough.’ The second, or 
“direct,” derivation employed by Kac and Siegert is the 


7 Recent work of R. C. Emerson, J. Appl. Phys. 24, 1168 (1953), 
discusses this problem from an alternative point of view, using the 
cumulants (semi-invariants) of the distribution to avoid solving 
the associated integral equations of Kac and Siegert’s and the 
present approach. 

8 J. H. Van Vleck and D. Middleton, J. Appl. Phys. 17, 940 
tioaay’ see also D. Middleton, Proc. Inst. Radio Engrs. 36, 1467 

1948). 

*M. A. Meyer, “On Some Distribution Functions in the Theory 
of Random Noise,” doctoral dissertation, Harvard University, 
June (1952), pp. 2-6, 2-7, and following. 


basis of our approach to the higher-order problems. 
This direct approach requires an appropriate trans- 
formation to express the output wave form in terms of 
the input. The statistics of the output are then deter- 
mined by suitable additional transformations with 
respect to the original, input statistics. With this 
method Kac and Siegert also obtained the mth-order 
characteristic function for the detector’s output in the 
case of noise alone. 

The present paper obtains the following new results: 


(1) The general mth-order characteristic function for 
the filtered output following quadratic rectification, in 
terms of the eigenvalues (and eigenfunctions) of a 
certain integral equation, for signal and noise, as well 
as noise alone. 

(2) The second-order characteristic function F,(¢1,¢2) 
as a special case of (1) for both broad- and narrow-band 
input waves. 

(3) A special video filter which gives the same second- 
order density W: as for an infinitely wide video, in the 
case of a band-limited IF response. 

(4) Approximations for W» in the limiting cases of 
wide-or narrow-videos for noise alone, and for a signal 
and noise. 


These results show, as expected, that the distribution 
tends to that obtained for an infinitely wide filter when 
the final filter is wide compared to the IF, and to a 
normal distribution,®* when the final filter is narrower 
than the IF. 

The main features of the analysis, which is necessarily 
quite involved, are outlined in the succeeding sections, 
along with a summary of special results of interest in 
the limiting cases. Sufficient detail is maintained to 
facilitate calculations of any additional cases, if so 
desired. 


2. THE CHARACTERISTIC FUNCTION AFTER 
RECTIFICATION. NOISE ALONE 


Our first major problem is to find the mth-order 
characteristic function ¢, for the output of the quad- 
ratic rectifier in terms of the statistics of the input IF 
wave. We next determine the transformed charac- 
teristic function for the output of a linear filter when 
the input has a given characteristic function. Applying 
these operations successively gives us the desired charac- 


% G. R. Arthur, J. Appl. Phys. 23, 1143 (1952). 
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teristic function Fy(f1,---,{,) for the output of the 
video filter when the input is the detected IF dis- 
turbance. 

Two forms of Fy are needed in limiting cases. One of 
these is expressed in terms of the eigenvalues and eigen- 
functions of the integral equation mentioned above; 
the other is associated with the Fredholm determinant 
of this integral equation. The quantity Fy is calculated 
initially in terms of the time-response (i.e., weighting) 
functions of the IF and video filters. We next transform 
these expressions of Fy into representations where the 
frequency-response (i.e., system) functions now appear. 
From the integral equation and the determinantal 
expansion in this new form we obtain finally the desired 
approximations and other results, since a “wide” or 
“narrow” video has meaning primarily in the frequency 
domain. 

We start with the mth-order normal distribution 
density and associated characteristic function of a 
random variable « which has a zero mean :”” 


W(x, °° ty) = (23)-* | yl? 


kl 


Xexp = (0/2) Fa , (2.1a) 
kl u 
o(f1,°° fn) = explé 2 Suste)) 
=exp(— (1/2) x Sefer), (2.1) 


where w is the matrix of the variances (x,«1), (24,21) =0; 
u* is the cofactor of the kth row, /th column of up, 
and uw itself is a symmetric matrix, since (x,%1)=(«1xx). 
Furthermore, we assume an ergodic process, so that the 
statistical averages (x,x1) are equivalent to the time 
averages (x.%1)w=R(t,—t), (k,/=1, ---, a), where the 
Ri. form a set of autocorrelation functions, and x1: --%p 
represent the random wave at m successive times 
(t:,--+,tn). By the theorem of Wiener and Khintchine"” 
we can write 


R(r)= y w(f) cos2rrdf, (2.2) 


where w(f) is the (power) spectral density of the random 
wave (in this case before it enters the IF filter). Thus, 
if we know the spectral density, we can completely 
specify the probability density (2.1) in this instance. 

When the input is passed through the IF, one easily 
shows that the correlation function and spectrum of 
the output are 


(Vit+t)Vit+h))=RG—t,) r 


-f " w( fre costef(e—t)df, (2.22) 


1H. Cramér, Mathematical Methods of Statistics (Princeton 
University Press, Princeton, (1946); see Chap. 10. 

"N. Wiener, Acta Math. 55, 117 (1930). 

2 A. Khintchine, Math. Ann. 109, 604 (1934). 


and 


w(f)rr=|B(f)|*w(f), (2.2b) 


where B(f) is the system function of the IF filter. In 
particular, if the input wave is white noise of constant 
spectral intensity D, then w(f)rr=D|B(f)|?; hence- 
forth V;(¢) represents the random input to the quad- 
ratic detector. 

We now distinguish between various outputs of the 
IF amplifier on the basis of their spectral character: 
when the power spectrum (measured, say, between 
half-intensity points) is narrow compared to the central 
or “resonant” frequency of the wave, it is called 
narrow-band; when this condition is not satisfied, we 
have broad-band noise. The former, for instance, is 
typical of white noise through a relatively high-O 
filter. It turns out analytically that it is more con- 
venient to discuss first these narrow-band cases, and 
then tospecialize the results to the broad-band situation. 
Accordingly, we begin by assuming a narrow-band wave 
possessing a symmetrical spectrum, centered at the 
frequency fo, as shown in Fig. 2. The wave V;,(#) 
can then be represented as a combination of slowly and 
rapidly varying components in the usual way'? 


V(t)=R(t) cosLwet+¥ (A) ] 
=2(t) coswol+y(t) sinwol, (2.3a) 


x(t)=Rceos¥; y(t)=Rsin¥, (2.3b) 


where R and W are, respectively, the envelope and 
phase of the wave. 

The quadratic detector through which V;(¢) is next 
passed yields the instantaneous output 


Vo()=BV (0), (— o<Vi<w ); (2.4a) 


= “Tet 9”) + (1/2) (a? cos2wot 
+ (2xy— yy?) sin2wot)] (2.4b) 


w(f) 








——_—_—_—» f 


fr 
° 


Fic. 2. Narrow-band noise spectrum. 
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for narrow-band waves. (For simplicity, in subsequent 
work we set §?=2 for narrow-band waves and §?= 1 for 
broad-band disturbances, the extension to other values 
being obvious.) The low-frequency output, with which 
we are concerned here, is at once 


Vo(t)rr=2?+y"; at /=1,;, 
(Vir) js=x(ts"+y(tj2=x7+y?, (2.5) 


this last for the m time points ¢; (j=1---m). But since 
V,(t) is normally distributed, so also are x and y,'? 
giving the moments 


(x 2)= f w(rrdf=(y7)* fw f—fued f=¥/s, 
; . "  =(Vy*),  (2.6a) 
{x jn) = (yin) = f w(f— fo)rr cosw(—t;)d f 


"7 y 
=-a(4.—t;)=—a,;, (2.6b) 
2 2 
(x5) = > (xRy;) =f w(f— fore sinw(t,—t;)df, (2.6c) 


this last since the spectrum is narrow-band. 

The nth-order characteristic function associated with 
the mth-order distribution density of the low-frequency 
output of the quadratic detector at m successive times is 


on (S1,°* + $n) = (expi ) s 5Vurlt+4)) 


j=l 


-{ vo f Was t05 94°90) 


| ex > i(%?+y/) ss *+d%ndy,++dyn, (2.7) 


7=1 
where W,, is the 2mth-order distribution obtained from 
(2.1) on letting 
Xe= Xk, (k, [= 1, sdesties! n); 
Xe = Vk (k, l=n+1, sid 2n) 


therein. In most instances we can assume a symmetrical 
pass band, so that («;yx), Eq. (2.6c), vanishes, where- 
upon W ,(%1,°* + ,%n3; ¥1,°**,¥n) factors into W_(x1- + -%,) 
XW .(y1°+*¥n), each term of which is precisely (2.1). 
With the notation of Kac and Siegert, viz., (2.6b): 


° u 
(x en) = (1/2) f w(f)rr cos(w—wo) (—t;)d f= >t 
we obtain finally for independent x and y 


be bur -ohd= Grrlaly| f- faced, 





n att 
xexp| — p » — itv) (2.8) 
cra \ ye 
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in which 6,; is the familiar Kronecker delta: 6,:;=1, 
k=l; =0, k+l. 

The next step is to integrate (2.8) by reducing the 
quadratic form to the sum of squares (see Appendix I). 
When this is done we obtain a result of Kac and 
Siegert®: 


on (Si, ~ *5n) =1"/ | Siabprjt+id.;| » Pkj=Akjy (2.9) 
where we have replaced a; by px; to get a more familiar 
nomenclature. The characteristic function @y is given 
in terms of the determinant | {,px;+74,;|, in which the 
statistical nature of the input noise enters only through 
the correlation coefficients , 


eeu J w(f— fare coset) f= 2a). 


—.2 


An alternative form of (2.9), which is more useful in 
the discussion to follow, is (see Appendix I): 


oro -Ba)= Tay | (2.10a) 
j=l 
(narrow-band) be 


v 2 SxPemf ike=jf im; (2.10b) 
=l 


ie. \; is the jth eigenvalue of the set of homogeneous 
linear equations given in (2.10b), and fj, are the com- 
penents of the jth eigenvector f;. Note that in this 
representation A; is a function of £1, £2, «++, &n. 

The broad-band case for noise alone follows at once 
from the observation that now the output of the de- 
tector (2.4a) can be written Vo(¢)=6?X?(t), where the 
moments are 


(X)X,)=vou= f w(f)rr cosw(t;—h)df; 
in (X?)=(X2)=y, etc. (2.11) 


Setting 8=1 here, and repeating the procedure outlined 
above for (2.7)-(2.9), we obtain just the square root 
of (2.9) and (2.10) for the characteristic function after 
rectification in this case: 


(broad-band) : on (f1,° + + fw) =4"!?/ | Cabprj+ tbe; |"? 


=] (i—ia)-¥2. (2.12) 


j=l 


3. CHARACTERISTIC FUNCTION AFTER 
RECTIFICATION. SIGNAL AND NOISE 


As in Sec. 2, we divide the discussion into two parts, 
considering first the more general analysis of the 
narrow-band case, where now a signal as well as noise 
constitutes the input to the IF filter. The noise part of 
the rectifier’s input is given as before by (2.3), while the 
signal is represented by M(t’) coswof. Here M(?’) is a 











W 


is 
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modulation of the carriert coswof, and ¢’ rather than / 
is used to indicate the fact that modulation and (IF) 
carrier are uncorrelated. There is, of course, no corre- 
lation between signal and noise either. The total input 
to the detector is therefore 


Vi()=[2(O+M (’) | coswolt+ y(t) sinwol, 


and from (2.5) the corresponding low-frequency output 
of the quadratic detector becomes (8=v2) 


Vir(t)=2+y?+2xM+M?. (3.2) 


Letting x(¢+/;)=x;, y(t+t;)=y;, M(’+1,)=M;(U’), we 
find that the characteristic function takes the more 
general form 


enae“tbe J rh f Walteig: ++ 20) Wain" ** Yn) 


(3.1) 


Xexpl id ciLaPe+y7+ 25M ;(V/) 
jut 


+A, (0)8D avs --dyn, (3.3) 
which is explicitly, see (2.8), 


bsen S15" + hn) = (W"e"| a) exp 3 ig ;M 7(’') 


? 


oo) - a al yyy 
f “ee. fay “* *dVn exp( > tve—> —) 

aly 

n oF yx, 


—.) 
te) n 

f vo dey --dey expt 7 >> &x2— - 
a kl mt pla 


+21 > o04,.Mi, (t’) ) . (3.4) 


k 





Again introducing a suitable principal-axis transfor- 
mation (see Appendix I) and integrating give us finally 


(narrow-band): ds4n (1° + Fn) =TI (1-7 


k=1 


TI exp] (5 M(O8sfes) / Air) | (3.5a) 


where the eigenvalues A;=A;(f1,---,fn) and eigen- 
functions f,; are defined by (2.10b), with the additional 
normalizing relation 


n 


DX Siferfji= Sur. 


l=1 


(3.5b) 
The corresponding expression for broad-band waves 
is easily found. We now let S(¢’) represent the signal 


t Since we are dealing with stationary waves, one can always 
choose the origin of time such that cos(wot’ +¢)=coswol. 
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and X(t) the noise, so that the output (2.4a) of the 
quadratic detector, when V;=X-+S is the input, 
becomes at the times ¢; (j=1, ---, 2) 


V 5=LX5+-Si¢/) P=XP7+-S27(0)4+-2X5S;(), 
(8=1). (3.6) 


Comparison with (3.2) and (3.4) gives at once the 
desired characteristic function 


(broad-band): $s4n($1,° ++ $n) = TI(—ay)-” 


k=1] 


Teal: X Silthtsfes) / 0-29}. (3.7) 


k=1 


Both (3.5a) and (3.7) are generalizations of Kac and 
Siegert’s earlier result (2.12) for noise alone. 


4. THE POST-DETECTION FILTER 


There remains now to determine the effects of the 
linear, post-detection (‘‘video’’) filter on the statistics 
of the rectified wave. Our first task is to obtain the 
desired characteristic function after this post-detection 
filter for the broad and narrow-band cases discussed 
above. Special limiting cases of interest are reserved for 
Secs. (5-7) and Appendix II. 

If the input to the “video” filter is Va(¢) where Va(?) 
is Vzr(t), for broad- or narrow-band inputs, and if 
Voa(t) is the corresponding output of the filter, we can 
write in the steady state 


Vult)= f A(r)Va(t—1)dr, (4.1) 


where A(r) is the filter’s weighting function, which is 
related to the system function Y(iw) by the Fourier 
transforms 


Ao=f Y (iw)e'df, Vtis)= fo A(ije~i#'dt (4.2) 


oo 


for linear stable filters. For all real filters A (¢) =0, 
t<0-. 

The procedure now is to replace the integral (4.1) by 
a sum which (in the appropriate limit) will again be 
equal to the integral. The sum is here 


L 
(Voa)e=Vi= dX Ar(Va) av, 
l= 


and 


L 
(Vod)k—m¢5) =2. Ai(Va)e—t-mi), (4.3) 
= 


where the interval over which A(r,) is significant is 
divided into L+-1 points, each of which is a distance 
h apart, and for each (k—-1)h< 7. <kh; A(ri)=Anh. 
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The mnth-order characteristic functions Fy (¢1',---,¢n’) 
for the output Voa(?) of the post-detection filter is 


Fv(t)=Cexp|i 5Vut—n)}). (4.4) 


Substituting for Voa(/—7;), etc., from (4.3) into (4.4) 
and performing the statistical average gives 


Fy (t= f se fWLV da Wen] 


n L 
exp} , x CK , A (Va) (e—1—-m(k—-1)) 
1=0 


k=1 
Xd(Va)o---d(Va)_1, (4.5) 


where [m(j)—1]kh<1;<m(j)h. Here Wz is the 
(L+1)st-order probability density for the +1 inputs 
to the filter at the successive times /— 71, to /. (According 
to the present notation m(0)=0.) 

Our next step is to replacet the sum 


L * @& 
D Ai(Va)a-ima) by YL Army (Va)et 
l=0 l= 


Equation (4.5) is now written with the help of the 
Fourier transform of W,, viz., the characteristic func- 


tion (fo, - +1), 


Fv(t)= f - - faVae --d(Va)_1 


n L 
xexp| is’ DA man Vai] 
1=0 


k=1 


. (on) f eo fae ° -dt_1(£o,° os i_L) 
xexp| —i 3 (Va) . (4.6) 


The characteristic function $(fo,---,f~1) is associated 
with the Z+1 inputs (Va)o, ---, (Va)-x. Integrating 
over (Va), (q=0, ---, Z) we find the first set of 
integrals to be 


L n 
ps i(r.-E oA man) (4.6a) 
1=0 k=1 


t This substitution may be justified by the following argument: 
if we let /+-m(j)=q, then 


& L+m(j) 
2 Ai(Va)e-t-mip= 2 A cq—mciy) (Via) ea 
I-0 m(j) 


L+m(j) 
oat “J 


= 2 Ag—m¢i)(Va)e-g, 
0 


since A(r;)=0; 7;<0. Thus, if Z is chosen sufficiently large, so 
that A (1~m j))—0, then A,-0 also, and we can replace the upper 
limit on the last sum with L only. 


MEYER AND D. 





MIDDLETON 


so that (4.6) becomes finally 


Pe@)=0( $4'A—mie—1) 5° °° 5 
k=l 


n 


L 
LX fe’ A j—-mee-)3 °° Pe o%'A imu), (4.7) 
=! 


k=1 


where we remember that ¢ is here the characteristic 
function for the output of the quadratic rectifier, and 
Fy(@’) is the desired characteristic function of the 
output of the rectifier plus post-detection filter. A 
number of specific cases follow: . 


(a) Narrow-Band Noise 


We apply (4.7) now, using the results of Secs. 2 and 
3. Equations (2.9) and (4.7) give us first one form of 
the mth-order characteristic function for narrow-band 
noise after video filtering: 


Pre w=ie [NE fi Aum ) +b) 
q=l | 
(k=1, -e+, D), (4.8) 


and the determinant is an (L XL) one. Corresponding 
to (2.10) we have the alternative representation 


L 
Fy(@)v=[T G—-aysy, (4.9a) 


where 


L n 
E vinm( $An-me-n ) fn ifim (4.9b) 


q=l 


and since A,;=A (t,)h this sum can be rewritten 


L n 
2 (Hpimf jk) Do Sq A (te bm (g—1)) {i Ajfim. (4.9c) 


q=l 


Letting L approach © and hk approach 0 we get an 
integral for the sum over k, and so for the characteristic 
function of the continuous, filtered wave Voa(t), Eq. 
(4.1), the result (4.92) becomes in the limit (dropping 
the primes on {;, etc.) 


Fy(Qyv= ne —#r;), 


j=! 


(4.10a) 


where A; are the eigenvalues (here functions of {1, «++, &n 
and 71, «++, Tn-1) of the integral equation$ 


Af a E tel (tre) ptt) (=). (4.10b) 


0 q=l 


§ The eigenvalues of the finite sum are certainly discrete, but 
it may be questioned whether or not they remain discrete in the 
limit of the integral equation, since one can always find kernels 
for which this is not the case. However, for symmetric kernels 





In t 
if (4 


=~ 


For 
teri: 


Let 
mo 


and 


wh 


wil 


In 








—- eS aS 





DISTRIBUTIONS OF SIGNALS AND NOISE 1043 


In the limit, of course, one gets an infinite determinant, 
if (4.8) is used. 

The examples of chief interest occur for n=1, 2. 
For n=1 (first-order probability density and charac- 
teristic function) we find that (4.10b) reduces to 


vs f at (Dp(ts) f(D=AF()=AG) f(8). (4-11) 


Letting \;=A,/f1¥, we see that the integral equation is 
modified to 


f A (t)p(t,s) f(Qdt=' f(s), (4.11b) 


bal 


and the desired characteristic function becomes 


Fy(t1)v=TL(1—ityas)>, (4.12) 


j=l 


which is the result obtained by Kac and Siegert’ when 
y=1. 
When n=2, we get from (4.10a, b) 


Fy(tates7)v=TIL afta) (4.13a) 
j=l 


with the accompanying integral equation 


y J dil $A (+624 (t—1) Jolt,s) f(=Af(s). (4.130) 


K(s,t), which are quadratically integrable in the sense 


f ‘ f K*(s,t)dsdt= A (exists), (i) 


the proof of discreteness is easily given. We summarize the main 


points of the argument: let the eigenfunctions and eigenvalues be 
defined from 


J, KsyiWat=riy4(s). (i) 


Since we are looking for quadratically integrable solutions of (ii), 
we can choose these from an orthonormal set. Now the Fourier 
coefficients of K(s,t), with respect to y;(t), are given by A;y;(s). 
Bessel’s inequality yields (for arbitrary n) 


2 dtvi(s)< f- Rnd. (iii) 


Integrating over s we obtain 


zats< f ‘ ij K*(s,t)dsdt= A (exists) (iv) 


from (i). Therefore 2 A; converges, and the A,’s can have no 
inl 


finite point of accumulation. The spectrum is accordingly discrete. 
In our examples the quadratic integrability may be shown from 
the fact that /_..”| A (t)|dt exists, and the fact that p(s,t)=p(t,s) 
=p(|t—s|) never has a weaker falling-off for large (¢—s) than 
|(t—s)|?-e-l*-“, where P is finite (>0). The former applies 
because we are dealing with linear, stable filters, and the latter, 
because p represents the autocorrelation function of intensity- 
limited noise after it has been passed through such filters. We are 
indebted to Professor Ivar Stakgold for calling to our attention 
this question of the discrete character of the eigenvalues. 


The alternative form (4.8) proves useful in limiting 
cases of wide and narrow video. If we let 


A= |b je +-hK jx| 


be the Lth-order determinant (4.8), where A,=A (t,)h, 
we may expand A as follows” 


L PL Kis Kin 
A=1-y > KjA+—-D0 h? adie (4.14) 
7 2 ik kj Kuz 








In the limits M—«, h—-0, sums are replaced by in- 
tegrals, as before, y=i, and we get finally 


Pe(ttsitw= {1-1 f K (t,t) ,dt— (1/2!) 
0 


0 
where 


K (ti,t2),=Wp(te—h) [1A (t:)+2A (t;— r) |. (4.15b) 


Applications of this approach are considered in Secs. 
5 and 6. 

We note finally that if the video filter’s weighting 
function is such that A(t) >0, all />0, then only posi- 
tive eigenvalues can occur, corresponding physically 
to the fact that the output of such a filter is always 
positive (when the preceding detector is quadratic). 
Consequently, there will be a dc term, appearing as a 
finite average value for the output, i.e., (Vour)>0, as 
well as an ac fluctuation about this value, but no 
{Vout)<0. A simple example arises when the video 
filter is an RC network, with the output taken across 
the condenser, as then A (#)= (RC)“e—/®°, t>0. How- 
ever, when A(#) can be negative for some /(>0), a 
[finite] number of negative eigenvalues appear as 
well: the output can be negative part of the time, 
although there can still be a positive mean value (V) 
(which vanishes in the limit of a very narrow video). 
An example of this is provided by a CR post-detection 
filter, with the output observed now across the re- 
sistance, as here 


A (t)=8(t—0)— (RC)e-"F",  (t>0-). 


These phenomena are specifically illustrated by the 
results of Secs. 5, 6, see (5.3), (5.4), and (5.22), etc., 
and the foregoing remarks apply also for (b)—(d) 
following. 


K (ti,ti)+ K (ti,te)+ 
K (te,ti)+ K (te,te) 








-1 
atdtet +} , (4.15a) 


(b) Broad-Band Noise 


From (2.11), (2.12) it is clear that the results of Sec. 4 
may be used directly for broad-band problems, with the 


13 Lovitt, Linear Integral Equations (McGraw-Hill Book Com- 
pany, Inc., New York, 1924), p. 24. 

§] This number depends on the kernel and on the weighting 
function A(t), of course, but is finite for realizable networks. 
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simple modifications of Eqs. (4.8), (4.10), (4.13a), etc., 
i.e. of replacing the right-hand members by their 
square roots. 


(c) Narrow-Band Signal and Noise 


A direct modification of (3.5a) in conjunction with 
the transformation used to replace ¢_;’ in obtaining 
(4.7) permits us to write ¢;(=f_,;)=f1'A;, (j=0, ---, 
[L—) for the first-order characteristic function where 
there is a signal. Dropping the prime on ¢;’ gives us 


Fy(t:)sev=]L1— apt e) 
k=1 


xexp| ite f M('+0)A(0) so(pdt) / 


(1—it wrx’) ; (4.16) 


where Ax= xf: as in (4.11a, b). The associated integral 
equation and normalization of the eigenfunctions f? 
follow, respectively, from 


, 


f A (t)p(t,s) f (Odt=r,’ f® (s) ; 


—o 


of A(t) f™ (0P°dt=1. 


—o 


(4.17) 


From (4.16) it is clear that in dealing with a signal, as 
well as noise, one must also know the eigenfunctions; 
it is not enough to determine the eigenvalues alone, as 
is true of the case for noise only. 

The second-order characteristic function is obtained 
in the same way. The general transformation relating 
¢_; and ¢,’ now reduces to the two terms ¢;={,'A; 
+£2'A jm (7=0, ---, L-). Dropping primes again 
we get finally ; 


Fy (o1,625;7) stn = Tha —™x)" 


xexp if f M (t'+2[61A ()+£2A (t—1) J 
-s(pdt) /a-ino}, (4.18) 


where we remember that now Ax,=Ax(1,¢2;7). The asso- 
ciated integral equation and normalization condition 
on f* are 


y f [od (4 EeA (t— 2) ols) ( Odt=ruf(5), 
_ (4.19a) 


f [oA (t) +624 (t— 7) ]f (%dt= 1. (4.19b) 
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A check on these results is given in Appendix II in the 
special case of an infinitely wide video. 


(d) Broad-Band Signal and Noise 


The expressions (4.16)—(4.19) may be adapted at 
once to this case if we replace M (t/+/) by S(t’+2) and 
TI.(1—iA,) ~by Tk(1—7id,)-"*. [See Eq. (3.7) for 
infinitely wide video filter; also Appendix IT. ] 


5. THE NARROW VIDEO FILTER 
(a) Noise Alone 


If we consider now the determinantal relation (4.15a) 
and expand it in such a form as to produce the coef- 
ficients of the various powers of {)%2*"*, £2/%7"*, 
(j=1---n; k=0---7) we find that these coefficients 
consist of a sum of terms of order (Af,/Afr), (Af,/Afr)?, 
+++, (Af,/Afr)*. When the video band width Af, is 
small compared to the IF band width|] Afr a good 
approximation may be obtained by keeping terms up 
to and including (Af,/Afr)?. Letting n— © and picking 
out the significant terms to the desired order, we find 
finally that 


v 
Fry (61,6257) w= ef G82) | 1+ P (¢P+527)9q(0) 


ip* 
+ 2&1f2q(7) J+ ee (F2+605) pO) +36 °F 2p (7) 


l 


+3515? p(— 7) J+ 0(Af,/Afr)®} , (5.1) 
where 
p(r= ff fr@) Y (ten’) V (ten’ + iw) * 
XA )H (SPH f")e 
-dfdf'df” =O(Af,/Afr)’, (5.2a) 


a= f J H(f)H(f’)| ¥ (w+ io’) |2e't#"d fd f’ 


=O(Af,/Afr). (5.2b) 

Our next step is to express (5.1) in terms of an ex- 
ponential with linear and quadratic arguments, the 
higher-order terms appearing as a factor of the expo- 
nential. Inversion then gives the (asymptotic) expres- 
sion for the second-order probability density for narrow 
videos (vis-a-vis the IF): 


|| These band widths are those of the equivalent rectangular 
filters. For details of the method, see reference 8, Chap. V. 
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ae at 
+ (sia arava) | 
aV3aV2 aV,aV:* 100)a(r) 


+0(4f4/Afo%) }Ia(Vi Vestn (5.3) 





where (W2)y,o is the distribution density defined by 
W2(Vi,V 237), o= L2myq(0) (1— po?) 2 J 
( | (Vi-v)*+ (Ve—-)?—- 2(Vi-) nl ) 
Xexp| — , 
2¥q (0) (1— po?) 





(5.4) 

in which 

pa(r)=e(7)/,(0)= f A(t)A (t+ nat | f A*(t)dt. 
a ™ 5,5) 


[See Appendix III, Eq. (A3.1).] 

For broad-band noise one may use (5.3) after inserting 
a factor } before each term in the brackets [ ] and 
replacing y by ¥/2, ¥ by y¥7/2 in (5.4); y itself is then 
reinterpreted according to the remarks following refer- 
ence 14, 

The desired development of the characteristic func- 
tion and probability density may also be obtained 
directly from (4.13a): for the narrow-band case we 
have formally for small \; 


Fy Suks7)v=TIL1—aGi,g3;7) = a4 


j=1 


~exp| -¥ log(1—) 


=ep(iEa-a/nEart-). 66 


These sums of the eigenvalues are found in the usual 
way by expressing the symmetrical kernel (4.15b) as a 
series in the orthonormal set of eigenfunctions f(t) 


K (tte) = 5 Af (A) &) 


and integrating over ¢, and #2. The sums in (5.6) are 
easily seen to be the trace of the kernel and of the 
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first iterated kernel, respectively, which in the time and 
frequency domains become 


f K (t,t) ,dt= f KL af=L =WErtt9), (5.7) 
rs K (t,l2)-K (te,t1) edtydte 


. f f RUA Adfid =r 


j=1 


= (SP+52)¥¢ (0)+ 2oifa*q(r), (5.8) 


and so on for the second and higher iterations. The fre- 
quency form of K is 


K( fi, fo) = (Sit feet?) (fi) V (tor —tw2). (5.9) 


The correspondence between these results and (5.1)- 
(5.2b) indicates that the \; are indeed small for the 
narrow video (5.2), so that this latter approach may 
be followed for the corresponding problem of broad-band 
noise and signal and noise (considered below). 


(b) Signal and Noise 


Here the governing integral equation and the charac- 
teristic function for the second-order case (n=2) are 
given by (4.19), (4.18) for narrow-band waves. As 
mentioned above, when the video is narrow compared 
to the IF, the eigenvalues are small and the product 
[].-1%(1—7,)~ (for noise alone) approaches a normal 
characteristic function. Using an expansion similar to 
(5.6) we have for the exponent in (4.18) 


exp{i y —" y dets?— iy? x ABor+: ++}, (5.10a) 

where 
‘ai f ” MUFOL EA Df 1-1 (a)at. (8.10b) 
Expanding M(t’+-#) in the orthonormal set f() gives 
M(+)=¥ bf (0), (5.11) 


and from (5.10b) and the integral equation (4.19a) one 
sees that b.=c,. Then, multiplying (5.11) by M(t’+2) 
X[¢1A ()+¢2A (t— 17) ] and integrating gives us an ex- 
pression for the first term in the exponent of (5.10a) : 


J M?(t'+ [tA (t)+f2A (t— rte Eo Cx’. (5.12) 
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The other term in (5.10a) may be found by a similar 
technique: Expansion of p(s,t) gives 


a(s=¥ dy so= Af (s)f(W), (5.13) 


and multiplying by 


M (t+t')M (s+?) 
X [6A (+624 (t— 17) JIA (8) +24 (s— 7) ] 


and integrating over s,t yields 


f f p(s,t)M (t+0')M (s+U’){t2A (s)A (0) 
+§A(s—1)A(t— r)+2it2A (s)A(t—1)}dsdt 
~ Arce’. (5.14) 


For the third term in (5.10a) we multiply (5.13) by 
[¢1A (t)+£24 (t—7)]Jo(x,7) and with the aid of the 
integral equation (4.19a) obtain 


f [tA (+ 52d (t—1) Joes) ( sat 


=EA2f (x) f(s). (5.15) 


Multiplying this by the same expression used in de- 
riving (5.14), replacing ¢ by x, we obtain finally 


ff fes@+nsc-onta@tia—o) 
X{h1A (O+ 624 (t—7)} M(t’ +5) M (+2) 


< p(s,t)p(x,t)dxdsdi=>> dj2c,?_ (5.16) 
1 


in the general case. 

Because of spatial limitations we now outline only 
the derivation of the leading term in W2; the correction 
terms may be found in straightforward fashion with 
the help of (5.16) (expressed in frequency form), com- 





W2( Vi, V9;7) S+N,0>= [2m (WHi+ 2~M?H>)®;(0) (1 — po’) a 
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bined with (5.1), and then developed into a structure 
of the type (5.3), where, of course, (W2)w,o is replaced 
by (W2)s+n,o0 derived below. A further simplification 
(which does not in any way restrict the generality of 
the method) sets the signal equal to a constant: 
M (t')=M. Then (5.12) becomes 


J 


D oe? =MP(S1+$2), (5.17) 
k=l 


and the integrals occurring in (5.14) can be expressed 
in the frequency domain as 


[fa (s—h)A (t— 72)p(s,t)dsdt 
= f H(f) | Y )iw) |2eiw (rita) f 


nef rapes 


= H(0)®,(71— 72) = H(0)®;(72— 71), (5.18) 


since for the video narrow compared to the IF, 
H(f)=H(0). We get finally 
De Ancx? = M7H (0){ (F°-+527)P1 (0) + 2616241 (7)}. (5.19) 


k=1 


The characteristic function is the product of (5.10a) 
and (5.1), which for the normal term becomes on 
writing H(0)= Hp, 


Fy (61,6257) sw, 0 
= expli(¥t+ M*) (61+ $2)— (1/2){ (WHi+ 2pM?Ho) 
(82+ 52?) (0)+ 261681(7))} J, (5.20) 


where we have made the substitution for g(r): 
+00 
a=) [ PdfeHa(r), 6.21) 


which is valid for narrow videos. The corresponding 
probability density is 





- & (Y+M*) P+ LV2— W+M?) P—20Vi— Y+M") LV2— (¥+-M?) |po 
¥ (WHi+ 2~M"?Ho)®1(0) (1— po?) 


where po is given by (5.5). 


(Not that Hi= f ° p(r)2dt= f 7 H(s%4f). 


—o 2 


We observe in both (a) and (b) here, especially (5.3), 
(5.4), and (5.22) that in this approximation (q¢(0)+0, 


} oa 





®,(0)+0) the probability densities appear to exist for 
Vi, V2<0. Of course, this cannot strictly occur if 
A(t)>0, t>0, since there can be no negative output 
of the final video filter in such cases; (see remarks at 
end of (a), Sec. 4). The higher-order terms in these 
approximations tend to reduce this contribution for Vi, 
V2<0 to zero. 
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6. THE WIDE VIDEO FILTER 
(a) Narrow-Band Noise 


The case when the video band width is narrow com- 
pared to that of the IF stage has just been treated. We 
can also obtain expressions for the alternative limiting 
case where the video band width is large compared to 
the IF band width. The expected result should approach 
that of the distribution obtained with an infinitely 
wide, uniform video response. 

If the IF band is limited (practically speaking) to 
a band** 2B, and the video band width is 6, one makes 
a Taylor series expansion 





ae lw wo \? 
¥ (is) =1+ar—taa{ ) a 
2rB 2rB 


6.1 
ay~0(B/b), ( 


a2~0(B/b)? 
which certainly converges for w<27b. 

Evaluating the coefficients in the determinantal ex- 
pansion of Fy (fife7), see (4.15), we obtain 


D: =f f aon 


X (w+ exw’ +-w"*) peor faf’ 


+2f f 
b Mb 


retaining terms through the second order. The con- 
tribution of the second integral is negligible, since H (/) 
falls off at least as (1/w*) and Y (iw) at least as (1/w) for 
w>b/2r. Also, if H(f) falls off fast enough, the finite 
limits in the first integral may be replaced with neg- 
ligible error by infinite limits. The latter step is not 
necessary, but is convenient for some cases. 

Also we find [see (A.1) and (A3.9)] that 


+ 2a2) 
(20)*B* 





Y (iw iw’) |2e+~*d fdf, (6.2) 


D3(1,1)= J i J H(f")H('+f")H—f) 


aa = (w+ ww’) tag 
é [i+ (a,-+ 2a2)— (2n)B ~ jasas df P (6.3) 


** This is true in many practical networks. In a stagger-tuned 


amplifier of m stages, for example, the normalized frequency 
response is 


1 ; 

V (iw)| = licwar| 
| ¥ Gu)| 1+ w/w)” 
The power, proportional to | Y (tw) |? is } at o=w», and is (1+22")-! 
=2>2" at w=4w». Thus for a 5-stage amplifier the ratio of power 
at w=u, tow=4w,) is 512-1. - 
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which produces the desired approximation for the 
characteristic function 
ay+ =) 


wy 
SPEEA ova (++ F210) 





Fe(bitsir)n&| Fr (uty | ot (= 


aa 
x 
2 





0) 
+i°(a+s'@-"—) rer. 


1(0) s 
+i-(a0+s(--"—)rse| | , 64) 


where 
~a[(r)] 
1(7) =————————_, 6.5 
qi(7) On)tor* (6.5) 
1 2 es) 
s=(— \ Sf fovrne \H(f+f") 
; XH(f—f"e-“"dfaf'af”, (6.6) 
sien f H(fye-*"df; (6.7) 


—o 


and Fy({1,2;7)w|.~ is the characteristic function for 
infinite video when noise alone enters the quadratic 
detector. We get finally 


Fy(O1b037) w=F v(fike37) w | o 


(2a2+a1*)yv* 
x|1- Fry bit) v 





OG. i} (6.8) 


where specifically 


qi(O) 
(61,62) = [Porte titial+= (62-+$2°)q (0) 











qi(0) 
+ivfae)ts'()— = fev 
i(0) 
+i] aos fx] 6.9) 
Thus 
2a2+a’ 
WalV iV ait) = Wel VV ain) xa ( B ) 


id 10 . 

xo(—, — W2 (Vi,V 257) N, cs (6.10) 
OV; OV. 

where W2(Vi,V2;r)n,~ is the distribution for the in- 

finitely wide video, see (A2.16), and W.“? is obtained 
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from the convolution 


WV Vite f feos. 


xXW2(Vi- (6.11) 


2,V2—YV37) v, d2dy. 


(b) Wide-Band Noise 


The results for wide-band noise follow easily from 
the work above. We find 


1 /2a2+ 10° 
Fe (bibs wP (Gsb0). of 1 ( ) 
2 B? 





XWVF v (05,625 7), (F102) + - | (6.12) 


Here 6(¢1,¢2) is given by (6.9) and the Fourier transform 
of Fy (¢1,€2;7) w, 2 is the second-order probability density 
W2'(V1,V2,7)n|. obtained by the square-law rectifica- 
tion of wide-band noise. We'(V1,V2;7)n|.. can be ob- 
tained directly from the normal distribution, that is, 
from (A2.18). The result then is 


VY /2a2+ar 
W.’ ( Vi, V2;7) N= [was V5;7) Nie —( ) 
2 B 





ts) te) 
x0(i—, i— Yara» (susir) (6.13) 
Ox, OAXe 


where W.“)(V,,V2,7) is given here by 


ff W: aattiensait ‘(x- 


-—@ ~@ 


X2— 37) Nv, edzdy. (6.14) 


(c) First-Order Approximation (Narrow- 
Band Noise) 


As an example, we observe that it is possible to get 
approximations to W,(V)y for narrow-band noise by 
solving the integral equation for the first-order case, 
vV12., 


J Y (ies — eos) H(f:)g (fod f= dag®(f). (A3.11) 


The present method differs from that used by Kac 
and Siegert in that it yields any number of eigenvalues, 
depending on how closely one approximates the video 
filter by a Taylor series. Certain limitations on the 
types of filter for which approximations to W,(V) can be 
found by the method of Kac and Siegert are removed. 

Using the Taylor series expansion (6.1) and substi- 
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tuting into the integral equation above, we obtain 


—we) a2(wi- 
2rB (2x B)? 





we)? 
Jemunar, 
nol Y (tw1—iwe)g™ (fo)d fe 


b 
+H(f) f ¥ (ieo—iws)g (fod fo= dag (f). (6.15) 


Expanding and neglecting the last two integrals, we 
find thatTT 


aol f g (fo +4 wasted ar 
: 2 2rB (2xB)* 


ta, aw) 
ws (k) fe Li a d 
+f 26 | 2rB (2xB)* 1 Js 


2 i wo?g*? WG 











= if.|=ng(P. (6.16) 
7B)? 


Solving the integral equation, we obtain finally these 
important eigenvalues: 


were (=) f° PH(f)df,  (6.17a) 


which is the same result for \; as obtained by Kac and 
Siegert.[{f We have also 


ta petee 
“Np J (ny? 


H(f)df- ftH(f)d 
I PH Hf (f) il. 


Using Eqs. (4.46), (4.47) of reference 5 and the fore- 
going, we may write approximately 





(6.17b) 


and 





(6.17c) 





waV)v=| _ “| exp[—V(1—a)/¥] 
a. (—V/y), V<0, (6.18) 
—— exp——— exp(—V » FB, ; 
y ob of 


where 1—\,;=\2=—a, As=7; (a>0, y<0) and all 
terms of 0(B/b)? are included. In this case V> |a|y, as 
a—0, and we obtain on noting that | y|<«(a) the further 


tt The infinite limits apply in the same sense as the cases con- 
sidered in Eqs. (6.2) and (6. 3). 

tt Note that for certain functions H(/) we cannot integrate | 
between infinite limits, but must use —b and +5. 
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approximation 


ev V 
wV)»*——| 120+ 0} (V>0). (6.19) 
y y 


(d) A First-Order Approximation. Narrow-Band Signal 
and Noise 


A further example in the case of wide video filters is 
provided when the input is the sum of signal and noise ; 
the main features of the approximation are outlined 
below. 

We start with (4.16) and the integral equation (4.17). 
Our approximation is limited to the calculation of a 
single eigenvalue (A;), and a single eigenfunction f“ (¢). 
Calling 


( fo morna (Of (at) =H /e, 


and writing \;=1+a, viz. (6.18), we see that Eq. (4.16) 
becomes 


Fy) s¢v = (1-Hii—aif) 
Xexplit wk?/(1—itwW—aigw)]. (6.20) 
Making the approximation for small a 
(1-1 — aif) = (1— fp)“ + aif / (1— ify)? 


and expanding the corresponding term (containing a) 
in the exponent of (6.20), we obtain finally 


Fy(¢1)s+.=—— exp| ——— 
(1—if1) 1-w 


xX [i+ taf WF y (¢) S4N, oop SPR Fy" (¢) S+N, m | (6.21) 


where we have retained terms through O(a) only. The 
Fourier transform of (6.21) is easily found, see (A2.1), 
(A2.2), and we can write finally 


1 it Wr ) 


0 
Wi(V) sev FWi(V) sex| ee (V) sty 
1 


x 
+ rr howe (6.22) 
OV? 


to this order of approximation; Wi(V) sn]. is given by 
(A2.2), and 


WV) six f e7/VW1(V—x) sin|dx/y, 


W,°(V) sve f e¥W 1) (V—x) sindx/p. 
7. A SPECIAL VIDEO FILTER 


If the IF noise and signal wave are confined to a band 
2f, this implies that the slowly varying part has a 


maximum frequency spread fy. A square-law detector 
doubles this spread to 2/,, so that the maximum band 
that the detected wave occupies is 2 f,. Thus, if the video 
filter is flat over this band, one would expect the same 
result as for an infinitely wide filter. The expected 
result can be shown as follows: 


We choose 
VY (iw)=e-', O<|f| <2fr, 
f<-fe 
H(f)=0 7.1 
(f) i ' (7.1) 


The condition H(f)=0 at frequencies greater than f/f, 
from the center of the IF pass band is often practically 
met, as for example in a stagger-tuned amplifier with 
many stages. 

Using the frequency form of the integral equation 
(A3.11), choosing 


g(f)=LkiS iH (f)+ kof 2c (f) eine (7.2) 


and substituting this into the integral equation, we 
obtain finally 


Loaf Fadi t kof 20 (7)} + S2e!7{ kik ip (7) + kof 2} J 
=)(Rigit Rofee-"). (7.3) 
This relation can be satisfied if we set 
(Si A) i+ S2k2p(7) =, (7.4a) 
§1kip(7)+ke(f2—A) =0, (7.4b) 


which leads to the same values of \ as found in Appendix 
II (A2.9), for the case of an infinitely wide video. 


8. CONCLUDING REMARKS 


The preceding analysis has shown that the second- 
and higher-order distribution functions for noise or 
signal and noise which have been rectified by a quad- 
ratic detector and then filtered can be represented only 
in closed form in terms of the characteristic functions 
of these distributions. Even so, such representations 
imply the solution of a difficult integra! equation. 

In many instances of practical importance, we are 
interested in limiting cases, i.e., either a very wide or a 
very narrow video. For wide video, by expanding the 
characteristic function we can obtain approximations to 
the second-order distribution function, with correction 
terms obtained from the known distribution function 
for infinitely wide video response. As seen in Sec. 7, a 
wide video means a video band pass that is of the order 
of the band width at the IF frequency which contains 
most of the noise power. 

Again, for cases where another nonlinear operation 
follows video (or audio) filtering, the characteristic 
function is of value and the approximations developed 
here can be used to advantage. As the video filter 
becomes more narrow, terms in the denominator of 











1050 M. A. 


Fy (¢1,f2;7) representing powers of (¢1,f2) greater than 
unity, assume more importance. It appears that powers 
of ¢ up to the third yield results which imply a first- 
order correction to the distribution for the infinitely 
wide post detection filter. 

As may be seen from the approximations developed 
for W,(x) in Sec. 6, more terms in the Taylor series 
approximation to the video filter function must be con- 
sidered as the video becomes narrower, and more eigen- 
values of significance are obtained. 

In the extreme of a narrow-video filter one would 
expect from physical considerations that the distribu- 
tion becomes normal. The results of the analytical work 
show to what extent the actual distribution departs 
from the normal for videos narrow compared to the 
IF band. Correction terms have coefficients whose mag- 
nitudes are of the order of powers of the ratio 


(— width=) 

IF width=2B ) 
APPENDIX I. MATRIX EQUATIONS 

In Eq. (2.8) the exponent , 





2 at! 
~ a — itu) nati 
kimi\laly 
is represented in matrix form by x’Bx, where x is a column vector, 
x’ the transpose, and B is a symmetrical matrix with components 
Bui= (a**/p| a! )—ite1; |a| is the determinant of @ and a*! the 
cofactor of a; in a. An orthogonal matrix can be found such that 
P’BP=AI, where A is diagonal (I=4,;). Letting Z= P-'x, x’Bx 
n 


becomes 2 \,Z;*, where the A; are the components of A = [A,4;« ]. 
im 


By a simple quadrature one obtains 


oy (f1,°° f= "la! TT Awt=y"|Ba!, (AL.1) 
imi 


since II \;=|A|=|B|. This gives (2.9) with a little further 
im 


manipulation, since ; 
B=ay"-if; aw'=[a!/\a|J=o". (A1.2) 
Now, by finding a symmetrical matrix Q, such that Q(A—iI1)Q* 
=A’I, where 
CA Je, j= Seber, 7 +454, j, (A1.3) 
inasmuch as ¢y(f1,- + -¢{n)=i"|A|-'=|I—iA’|-, we obtain the 
result (2.10a), viz., 


dv (t1y** tn) = Taj). (A1.4) 
j=l 


The equations for the rows of Q become the equations for the 
eigenvectors f;;)(A—iI)=2'f;;), or 


YZ Sipemfjk=j' fim; 
which is Eq. (2.10b). 


(A1.5) 


APPENDIX II. DISTRIBUTIONS FOR THE 
INFINITELY WIDE VIDEO FILTER 


When the video filter is infinitely wide we expect the general 
results of Sec. 4 to reduce to the more familiar distributions of the 
amplitude following square-law rectification alone. The latter 
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may, of course, be obtained directly from (2.10)-(2.12) for noise 
alone, or from (3.5a), (3.7) for a signal and noise, or, as was 
originally done in first-order cases, by a direct transformation of 
variables Vo=6*Vj,? according to the detector’s law, applied to 
the original normal distribution for the noise (see Sec. 2). Ac- 
cordingly, we now verify that our general results for arbitrary 
filtering, in the case of the first- and second-order densities, reduce 
to the expected forms when the video filter is allowed to become 
indefinitely wide.§§ The first-order densities for signal and noise 
follow at once. 


(I) First-Order Densities 


For an infinitely wide video, the filter’s weighting function i 
A (t)=6(t—0), and the solution to (4.17) is at once f(s)=Cp(s) 
and A;’=1. The normalization condition gives C={¢;-"?, whereupon 
(4.16) becomes ‘ 


(a) Narrow-Band Waves.— 
Fy (o1) sin! 0= san ($1) = (1—-itiy) 
Xexp(ligi Ml’ ?/(1—yei)].  (A2.1) 
The corresponding probability density is found with the help 
of Pair No. 650.0 [G. A. Campbell and R. M. Foster, Fourier 
Integrals for Practical Applications (D. Van Nostrand Company, 
Inc., New York, 1947)] to be 
Wi(V) syn] e=¥ exp(—(V+M")/p)Io(2MV4/y), 
V>0; =0; V<0O; (W=2(Vy?) (A2.2) 
which is a well-known result.5-'4 For noise alone one has 
WiA(V)\.=ye"¥, V>0; =0; V<0, (A2.3) 


which is a x? density with 2 degrees of freedom, as expected. 
[From (2.6a) we recall that since 6?=2 here, y=2(\Vy*), where 
(Vy?) is the mean-square noise output of the IF filter. ] 

(b) Broad-band Waves.— 

Here (A2.1) is appropriately modified, to give us 


Wi(V sev le= J expl (it +itis?)/(1—i2ys1)] 


m Ge-Fn =) 


= (2rVy)? exp[— (s?+V)/2y] cosh[V#s/y], (A2.4) 


V>0; =0; V<0, 
where we have again used transform pair No. 650.0. This result 
has also been obtained by Emerson" in a recent article. For noise 
alone we get a x? distribution with 1 degree of freedom: 
W1(V)w| o=(2rVy)-te-¥'¥, V>0; =0; V<0, (W=(Vy?)). 
(A2.5) 


(II) Second-Order Densities 


Here the calculations are more involved. From (4.18) we have 
for general post-detection filtering 


Fy (f1,¢237)s4w= I(1—idx)-* exp{ (if M (t' +1) 
k=1 


2 — 
x [1 () +524 2) Ifa) / av}, (4.18) 


where a=1 for narrow-band waves, and a=} for broad-band 
disturbances, with M replaced by S. The associated integral 


§§ The carrier or central-frequency zone in the narrow-band case is, 
however, not passed by this “‘infinitely’’ wide video. 

4S, A. Goudsmit, RL 43-21; J. C. Slater, RL V-23; D. O. North, RCA 
Tech. Rept. PTR6C; K. A. Norton and V. D. Landon, Proc. Inst. Radio 
Engrs. 30, 425 (1942). (RL =Radiation Laboratory, MIT.) 

1 R. C. Emerson, J. Appl. Phys. 24, 1168 (1953). (Here V =y Y, S?=yX, 
and our ¥ =(VwN?), since here 6?=1.) In general, for arbitrary 8 we must 
replace ¥ by 6%/2 =6%(VN2)/2 for narrow-band waves, and 6% =8%(V x?) 
for broad-band waves, in all results of the present paper. 








V 


~~ S&S fH = = hes 6D 


a> 


— ——_ ~~ a 


ma za a 2 





oise 
was 
n of 
1 to 


‘ary 
luce 
»me 
oise 


ni 
o(s) 
pon 


2.1) 
help 


irier 
any, 


2.2) 


2.3) 


‘ted. 
here 


\2.4) 


esult 
roise 


\2.5) 


have 


4.18) 


and 
ogral 


se is, 
RCA 
Radio 


=yX, 
must 
(Vv?) 





DISTRIBUTIONS OF SIGNALS AND NOISE 1051 


equation is 
Vf Cd W+h24 Ur) D0,s)f Odt= f(s) (A2.6) 


and since A (t)=6(t—0); p(0,s)=p(—s)=p(s); o(7, —s)=plr—s), 
etc., Eq. (A2.6) becomes 


$(s)f (0)+F20(7—5)f (7) =Arf™ (SW, (A2.7a) 
subject to the normalization conditions 
Ef OP+ Ef (2)=1. (A2.7b) 
In general, the integral equation (A2.6) is here satisfied by 
f®(s)=Ci™ (61,52) (S)+C2™ (61,52 )F20(s—7).  (A2.8) 


Substituting (A2.8) into (A2.7a) (or (A2.6)), rearranging terms, 
and setting the coefficients of p(s) and p(s—r) equal to zero, since 
the equations are then satisfied by arbitrary p(s), arbitrary s, r, 
we obtain 


c(t n)- C2 F2p(7)=0; 


—C,PEp(2)+C1(—F3) =0. (A2.9) 


For nontrivial values of C;, C2 we determine from this the eigen- 
values Ax, by setting the determinant of the coefficients Ci, C2 
equal to zero, obtaining two values 


Meh (hth lottit antl —p*)}4}; k=1,2 (A2.10) 


and hence four coefficients C,;“), C2“). From the two relations 
(A2.8) in (A2, 7a), and the two equations (A2.7b) for nor- 
malization we get the necessary number of expressions to deter- 
mine the coefficients, and hence the eigenfunctions. Unlike the 
case of noise alone, the eigenfunctions, as well as the eigenvalues 
Ax (k= 1, 2) are needed when there is a signal, as can be seen from 
(4.18) above. The desired probability density W2(V1,V2,7) sy! 
then follows by inversion, in the usual way. 

Since C;“), C,“) are, however, complicated functions of 1, 
and 2, the actual inversion is not easy. The expected result may 
be obtained, none the less, by a direct transformation of Ws, 
where Vour=6V in?= Rin?. For narrow-band cases, using a result 
of Middleton,'* we have at once 
(narrow-band): 


W2(Vi,V 37) spn | o= {L(1—p?) 7 exp[— (V+ V2) /W(1—p?)] 

Xexpl— (M2+M2—2M,Mp)/y(1—p*)]} 

+n , V.)3 - '_- 
_ 2p(V iV 2) Mn M, M2, Vi )In (ae avs), 

Mo ¥(1—p?) v(i—p) ¥(1— 9?) 

Vi, V2>0; Vi, V2<0 (A2.11) 

and where M,= M(t); M2= M (te). 

In the broad-band situation a direct transformation of (2.9), 


using (2.10), of reference (16) and the fact that Vout=6?V in? 
gives us immediately (6?=1): 


(broad-band): 


W2(V1,V 9537) syn | o= ({2rYLV iV 2(1—p?) 4} 
-exp{—((Vit-V2—2p(ViV2)#]/2y(1—p*)}) 
-exp{ —(S5:°+S2—2pS,S2—2(V1)#(Si—pS2) 
—2(V2)4(S2—pS1)]/2y(1—p*)}, 
Vi; V2>0; =0, Vi, V2<0. (A2.12) 
Where there is no signal, we do not need the eigenfunctions, 


only the eigenvalues (A2.10). The characteristic function (4.18) 
reduces now in the narrow-band case to 


Fy ($1,237) | o= 6(61,¢2)v = [1—WArde— tp (Ar FA2) J? 
=(1-W(it$2)—Poite(1—p2(7)) J. (A2.13) 





16D, Middleton, Quart. Appl. Math. 5, 445 (1948), Eq. (5.17). 


The second-order density is obtained from the Fourier transform 
of (A2.13); letting 1—p?=k, we can write this transform as 


20 —if2V2 0 
W2(Vi,V257)v| a= V(r)? f dz dgye~ iV 1 





» (1—ipkf2) J-« 
1—ifts 
x [+55 —itek :- (As. 


The second integral in (A2.14) has a pole in the lower plane for 
all real values of f2. For V:<0 the integral is zero, and since 
(A2.14) is symmetrical in ¢:V1, ¢2V2, it vanishes also for V2<0. 
For V:>0, we get for the integral in ¢, the value 

—2ni expl[—Vi(1—ipt2)/y(1— ipkee)). 
Letting u=kyt2+i=[(Vi/V2)(1+k)], we have 


te Vit VIRY  poti 
<< — J Lexp—i(ViV 2(1-+8)/ PY) 
X(n-9 n/n. (A2.15) 


This integral is recognized as a modified Bessel function of 
zeroth order, giving us finally the result 
W2(V1,V 257) | «= (¥(1—6?) * exp[— (Vit V2)/¥(1—")] 
2e(ViV2)"\ |. — 

KI ve) ), J “" J 2>0; =(), if 1,} 2<0 (A2.16) 
for the narrow-band case, which is immediately verified from 
(A2.11) on setting M,= M2=0. 

The broad-band case may be obtained in a similar way, where 
now the characteristic function is 
Fy (1,£237)v| @=(1— iW Sit o2)-Poase(1— pe?) 4. (A2.17) 
We have finally 
(broad-band): 
W2(V1,V 257) y| o= (2rY[V iV 2(1—p?) 3 

Xexp{—(VitV2—2e(ViV2)#]/2p(1—*)}, 

Vi, V2>0; =0,Vi1,V2<0, (A2.18) 
as expected, see (A2.12) when S;=52=0. 


J lox 
Vs 2rkY* 


APPENDIX III. EXPANSION OF THE CHARACTERISTIC 
FUNCTION F,({). TIME AND FREQUENCY 
REPRESENTATIONS 


As we have seen, a form of the characteristic function which 
is useful in the important limiting cases of wide or narrow video 
filters is given by the expansion of the reciprocal of the Fredholm 
determinant associated with the integral equation (4. 10b) in the 
general case, and with (4.13b) for n= 2. For the latter{the charac- 
teristic function i is A“, see (4.15). 

To represent the expansion of Fy(¢1,t2;7)w compactly in both 
the time and frequency domains, the following Fourier transforms 
are needed, relating system-, weighting-, correlation functions, and 
intensity spectra associated with the IF and video filters. We have 
for the IF filter 


o(r)= J H(Nei*"ds; o(0)=1 (A3.1) 
H(f)= f o(r)e~#%dr= |B( fe fo)|*, (A3.2) 
Slamata=1; fl Pids= fi e@drais. (433) 


For the video filter we write 


Y(iw)= f- AWe*'a; A= f= VGwe*'as; 
t<0, =0 (A3.4) 


\V(iw)|2= J" axe-*'a; =f” |¥Gu)|*e'af  (A3.5) 


=f" A(t')A ('t)dt’ =%,(+1), (A3.5a) 
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and 


2(r1,72)= f AW)AC + r)A (+12) 


= ff ¥ Ga)Y (ie!)¥ (ieo+iee!)* explo’ (11 12) — iors M fad /’ 


(A3.6) 


When the determinantal expansion is carried out through terms 
of order 3 in (¢1,¢2), the result is finally 


Fy Si,257)"= {1-ivi+4) 
Srey — D(0)) +25:$2(1— D(—1r))] 


+ EL Git teh + 3+ EM GP+E2)D20) +27) 
+2(¢2+£2)D;(0,0)+2¢:7¢2{2D3(— 7,0) +D;(7,7)} 
1 
+2h82(2D4(70)+D,(—7, —1) +--+}  (A3.7) 


where for broad-band noise { }~' is replaced by { }~}, as usual. 
The functions D2, D; are specifically 


Di(r)= f(t), (l+7)dt= rf H()H(S’) 


X | V (iw+iw’)!? expliWw+o’)r id fds’, (A3.8) 


Di(r,7)= ff (utr, v+7)p(u)p(v)p(u—v)dudv 


aiti ¥ (iw) ¥ (iw’ )V (iw +iw’ )*H (f— f”’) 


KAS +f" ed fd f'd ff". (A3.9): 





MEYER AND D. 
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Further relations of some use are 
2Ds(—1,0)+Da(r,7)=3 fff” ¥ (ie)¥ (iw )¥ (ieo+ie’)* 


XA(f— fH +f Af ed fa fds” (A3.10) 


and 2D;(7,0)+D;(—+7,—7) is precisely (A3.10) with w replaced 

by w’ in the exponent. The integral equation (4.13b) for n=2 is 

transformed in a similar way to 

vH(f) ff _ Sith: exp[i(w’—w)r]} F (iw— iw’ )g (f")df’ 
=Axg(f), (A3.11) 


where it is assumed that the f“(¢) possess a transform 


fo O= fe Neltas, Mf) =J™ sr Wertat.  (A3.12) 


APPENDIX IV. REMARK ON THE CALCULATION 
OF MOMENTS 


The moments of the distribution for W2(V1,V2;7) can be ob- 
tained easily from the Taylor series expansion for Fy (¢1,¢2;7), as 
is well known. That is, we can write, when these quantities exist, 


(ViEV 2!) = Age(—i tt /1 ik}, 


where the Ax; are the coefficients of the series expansion. One then 
uses the characteristic function expansion of Fy (¢i,{2;7) to obtain 
the corresponding coefficients in the Taylor’s series. 

One result of this calculation is 


iv) =e(14+ rf A(MA(P)| YES)? 


Xexp[— i(o+w'ri fal’), (A4.1) 


the correlation function of the output wave. 
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Diffraction of Electromagnetic Waves by an 
Aperture in a Large Screen 


L. LEWIN 


Standard Telecommunication Laboratories Limited, 
Enfield, Middlesex, England 


(Received April 5, 1954) 


N a letter in your February issue, J. H. Crysdale' deals with 

Bekefi’s* analysis of the field arising from a magnetic Hertzian 

vector II*. In attempting to satisfy boundary conditions with a 
single component II,*, the relation 


1,*=M,* (1) 


in the aperture was stated to be consistent with the boundary 
conditions on the magnetic fields derived from 


H,=@I1,*/dxdy 
A, =07I1,*/dy?+F11,*. (2) 


Obviously the equality of Il,* in the two regions ensures the 
equality of H, and H,. However, the fields calculated from (1) 
were not in agreement with experiment. 

Although consistent with (2), Eq. (1) is not the most general 
solution arising from (2). Continuity of H, leads to 


aI,*/ay+RI,* = 9 ;,*/ay+hM,* (3) 


Il,*=11;,*+A cosky+B sinky, (4) 


where A and B are independent of y. 

Continuity of Hz requires further that A and B be constant. 
Only if they are set equal to zero do we get Eq. (1). 

Using (4) instead of (1) the solution has been obtained to two 
problems which satisfy the condition imposed by the choice of a 
single component for II*, namely, the vanishing of E,. These 
are (a) the radiation of a rectangular wave-guide aperture in an 
infinite conducting screen* and (b) the transmission through a 
symmetrically situated rectangular window in a rectangular 
wave guide.‘ In both cases experimental results have been con- 
sistent with the theoretical predictions. 

In dealing with the expression in (4), no attempt was made to 
obtain A and B by fitting the solution to the boundary, but the 
form (3), in which A and B do not appear explicitly, was utilized 
directly. 

In any particular case it is necessary to confirm the absence of 
an E, component arising from the boundary conditions. Thus, 
since the tangential component of electric field must vanish at 
the aperture rim, the method will not work with a circular aper- 
ature; and in fact a rectangular opening is the only form for 
which the solution is appropriate. 

1J. H. Crysdale, J. Appl. Phys. 25, 270 (1954). 

2G. Bekefi, J. Appl. Phys. 24, 1123 (1953). 

3L. Lewin, Advanced Theory of Waveguides (Iliffe and Sons, London, 


1951), p. 121. 
4 See reference 3, p. 88. 





The Electron Atmosphere in Dielectrics 
M. J. MoRANT 
High Voltage Laboratory, Queen Mary College, London, England 
(Received January 28, 1954) 


N a recent paper, Skinner, Savage, and Rutzler,' have con- 
sidered the electron atmosphere in dielectrics bounded by plane 
metal electrodes. Their calculations proceed from earlier work by 
Mott and Gurney, in which the potential V(x) due to the space 
charge, of density p(x), in the dielectric is calculated from the 
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Poisson and Boltzmann equations. 


4xp 
vVV= x (1) 
e(V— Vo) 
pP=poexp—— (2) 


where K is the dielectric constant, and pp and Vo=0 are fixed by 
the metal-dielectric boundary (x=0), where 


aiid 2d emt exp a se- (3) 


Mott and Gurney show that 
V= "fo (28 a) e+}, (4) 


for a single metal in contact with an iil dielectric. This was 
compared by von Laue’ with the arrangement of two electrodes a 
finite distance apart, for which 


as 2k ry A 
V= , log, cos K-RT: %. (5) 
In this x=0, V= Vo at a point mid-way between the electrodes, 
and the corresponding charge density po’ has been given in terms 
of po of Eq. (3) by Fowler.‘ It was also shown by Fowler that (5) 
approximates to (4) for sufficiently small charge densities. 
Skinner, Savage, and Rutzler use a generalization, 











= a log.(ax+b) (6) 


of Eq. (4) to allow for different boundary conditions, but this form 
of solution is not permissible because of the lowering of the work 
function in the presence of a dielectric and the larger mean charge 
densities involved. This can be seen from Fig. 1 which gives a 
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Fic. 1. Potential in a bonded dielectric. 


comparison of (5) and (6) for a typical case quoted by them. The 
correct solution removes the anomaly of a discontinuity in 
dV /dx at the midpoint which is shown in many of their curves. 
1. Also there seems to be some confusion by these authors in 
their use of electron energy level diagrams. For more than one 
dielectric it is necessary to distinguish between the potential 
energy of an electron in the dielectric and the potential as given 
by the solution of the electrostatic problem. The energy of an 
electron at the bottom of the conduction band (depth x) of an 


insulating crystal, and at a point of electrostatic potential V is. 


—(eV+x). Thus the boundary conditions for a pair of electrodes 
of the same metal at a spacing D, are 
V(0)=0, V(D)=0, (7) 
and not 
V0)=—m, V(D)=—x:2. (8) 


1053 











1054 LEerzrers TO 
For different metals of work functions ¢; and ¢2 if V(0)=0, then 


V(D)="(6.-4), ) 


i.e., equal to the contact potential between the metals. This seems 
to have been ignored in reference 1, Fig. 3b of which shows a 
difference of Fermi levels of the metals in equilibrium with the 
dielectric. The equality of the Fermi levels of all metals in thermal 
and electrical equilibrium is independent of the presence of a 
dielectric between them, and is brought about by the contact 
PD. This also follows from Eq. (5) when correct boundary condi- 
tions are used. 

At the contact of two insulating crystals, the equations given 
in reference 1 lead to a change in the level of the conduction band 
of 

di—G2tkT 1og,(*") 

Ke 

(not 2kT log.(Ki/K2) if the derivation is followed through 
completely). This arises from the use of the wrong Eqs. (6) and 
(8). The complete solution for the two-electrode, two-dielectric 
case following Fowler, leads to a different, more complex, equation 
equation for V. It can be seen, however, that V should be con- 
continuous at the dielectric-dielectric interface so that the 
potential energy jump at the bottom of the conduction band is 
just (*%,;—x2). Using these facts it is possible to show diagram- 
matically, Fig. 2, the energy levels across the whole space. 


DIELECTRIC 
2 





METAL | METAL 2 


Fic. 2. Electron energy levels for two dielectrics in equilibrium 
with the electrodes. 


2. Skinner, Savage, and Rutzler consider the equilibrium of a 
dielectric bonded to one electrode and with a small air gap to the 
second electrode. Figure 2 shows that the form of the potential 
variation will then be different from the doubly bonded case for 
all values of x if the ajr gap is sufficiently thick for tunnelling to 
be unimportant. However, the total potential between the 
electrodes will remain at the contact potential difference and will 
not approach infinity as indicated by p/po~0. This affects the 
electrostatic contribution to adhesion more than the other errors. 

3. As for frictional electrification, the measured charges may 
not be entirely explained by this process, because of low values, 
ef the order of 0.1 ev, deduced for (6—-x). For such values, the 
charges measured should be extremely sensitive to variation of 
metal. The only theoretical estimates of x (those of Mott and 
Gurney’) give approximately 0.5 ev, so that (@—~x) should not be 
less than 2 ev. 

4. A further point that may easily be overlooked in considering 
dielectric space charges is the effect of the Fermi level of the 
crystal. Mott and Gurney’s calculation was for an ideal crystal 
(i.e., an insulator because of lack of conduction electrons), so that 
the distance Er of the Fermi level below the conduction band 
was very much greater than the increase, within a reasonable 
distance from the electrode, of ev due to the space charge. Thus 
the naturally occurring conduction electrons could be completely 
neglected. Applying the same model to an amorphous insulator, 
Er may be less, although still in the center of the forbidden range 
of energies, since the insulating property can be due to low 
electron mobility rather than lack of conduction electrons. This 
limits the potential rise, due to the space charge, to a value Er/e 
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so that the bulk metal and dielectric have their Fermi levels at 
the same height. The space charge is set up at the metal-dielectric 
contact to bring about this equilibrium condition: If Er is low 
the whole space charge may be concentrated near the electrodes, 
so that calculations based on an ideal crystal may be misleading. 
Similar considerations have to apply to the space charge at the 
contact of two dielectrics. 

As pointed out by Skinner, Savage, and Rutzler, electron traps 
which must strictly be included for an amorphous solid would 
modify the model considerably and lower the numerical value of 
Er even more. The voltage limitation considered above would be 
much more pronounced in such cases. 

' Skinner, Savage, and Rutzler, J. Appl. Phys. 24, 438 (1953). 

2N. F. Mott and R. W. Gurney, Electronic Processes in Ionic Crystals 
(Oxford University Press, London, 1948), second edition. 

3M. von Laue, Handbuch der Radiologie, 6, (Leipzig, 1925), p. 460. 


_*R. iH. Fowler, Statistical Mechanics (Cambridge University Press, 
Cambridge, 1936), second edition, p. 367. 





Electrical Phenomena in Adhesion. I. Electron 
Atmospheres in Dielectrics 


D. C. WEsT 
Nova Scotia Research Foundation, Halifax, Canada 
(Received February 25, 1954) 


HE recent article by Skinner, Savage, and Rutzler' under 

the above title appears to be a very valuable and thought- 

provoking study. However, there are one or two points which 
seem to have been overlooked. 

It has been assumed implicitly that there is no formation of 
surface charge at the junction of two insulators, as indicated by 
the boundary conditions imposed by Eq. (2) of the paper. The 
more general boundary conditions are 


lim [KiF (d—«) —K2F (d+) ]= —4ro, (1) 
€—0 


where K, and Kz are the dielectric constants of the two media 
which meet at x=d, F(x) is the electric field, and o is the surface 
charge density at the boundary. Since there is an electron atmos- 
phere in both media, it would be well to consider the pos- 
sibility of an accumulation of charge at the surface of dis- 
continuity. It is known, for example, that relatively large amounts 
of “static” electricity can be transferred from one dielectric to 
another by simple contact. 

An application of the same general equation to the situation 
at a, where the field reverses sign within one of the dielectrics, 
shows that 

lim [F (a+¢) —F (a—e) ]=490/K. (2) 
€—0 


Comparison with Eq. (3) of the paper shows that if there is no 
discontinuous charged surface at a(i.e., c=0), the factor g must 
be equal to —1. The case g= +1, treated in the paper, can occur 
only if there exists a discontinuous layer of charge at a with 
density c= KF (a+e)/2x. Although mathematically correct, their 
results, therefore, apply to a somewhat unreasonable physical 
situation. 

Fortunately it is quite easy to find another solution which does 
not require the presence of a thin sheet of charge in the center 
of a solid dielectric. Following the derivation given in footnote 
10 on page 442 of the paper, we have from diffusion theory and 
Einstein’s relation 


NeF =kTdN/dx. 
From the electrostatic field equations we also have 
KdF /dx=4rNe. 
Elimination of N from these two equations yields 
kT@?F /dx?—eF dF /dx=0, (3) 
which on integration gives 
2kTdF /dx—eF*=constant. (4) 
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There are now three cases, depending on whether the constant of 
integration is (a) zero, (b) positive, or (c) negative. 

(a) The choice of zero gives the solution of Mott and Gurney? 
used in Skinner, Savage, and Rutzler’s paper. It is most suitable 
for the case of a semi-infinite dielectric in zero external field. 

(b) Setting the constant equal to 4k®7?/e%m?, the solution 
becomes 


F=— (2kT m/e) cotm(x+20), (5.1) 


where xo has the same significance as in the paper, and m is an 
additional parameter. The space charge N and potential energy 
eV become 


N= (KkTm?/27e?) cosec?m(x+xo) (5.2) 
eV =2kT [log sinm (x+x0) —log sinmxy ]+eV 0. (5.3) 


It will be observed that the solution is valid only in a region 
—xo<x<—2xo+7/m and that near these limits the space charge, 
field and potential all tend to become infinite. This corresponds 
to the case of a dielectric slab of finite thickness between two 
sources of charge. x» depends on the charge density No at x=0 
as in the paper, and we may also introduce a corresponding 
constant x; connected with the charge density NV; at the junction of 
the dielectric with the second metal at x= D. The exact relation- 
ships are 


mx;=cosec!(e/m)/ (24N;i/KkT) (6) 
where i=0 or 1. The potential difference across the slab is given by 
Vi-—Vo=(kT/e) logNi/No. (7) 


The additional parameter m is connected with the dielectric 
thickness D by the relation 


m(D+x29+4x) =x. (8) 


All the functions are continuous within the dielectric as required 
for a reasonable physical theory. They tend toward the form 
given in the paper as D— =. 

(c) In a similar way the choice of a negative integration 
constant leads to a third solution, corresponding to the case of a 
semi-infinite dielectric subjected to an external field. At large 
distances this field reduces to the value F..=2kTm/e. In this case 
m becomes a parameter characterizing the field strength. This 
solution may possibly find application to the study of dielectric 
absorption in charged condensers. 

1 Skinner, Savage, and Rutzler, J. Appl. Phys. 24, 438 (1953). 


_? Mott and Gurney, Electronic Processes in lonic Crystals (Oxford 
University Press, London, 1940), p. 172. 





Electrical Phenomena in Adhesion : Further 
Comments on Electron Atmospheres*t 


SELBY M. SKINNER, ROBERT L. SAVAGE, AND JOHN E. RUTZLER, JR. 


Department of Chemistry and Chemical Engineering, 
Case Institute of Technology, Cleveland, Ohio 


(Received May 10, 1954) 


S Dr. West says, the diffusion equation has 3 solutions;' 
these were considered in preparing the paper. In their 
application to the general single dielectric, and even more so, to 
two dielectrics between the metals, the “finite’”’ solutions become 
excessively complicated.? Since the electrostatic contribution to 
adhesion is only appreciable with boundary charge densities of 
such magnitude that the infinite solution and the finite solution 
are essentially the same, the former was used as an approximation. 
Physically this means that the charge distribution characteristic 
of the infinite case has been chopped off at negligible charge, 
i.e., at a distance corresponding to an appropriate thickness of 
dielectric, and a similar distribution placed on the other end. 
This chopping off is responsible for the discontinuity in slope of 
the potential and charge density. 
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For adhesive thicknesses encountered in practice (>10~ cm), 
and electron densities (>10” electron cm~) at which the electro- 
static contribution need be considered, the effect on the adhesive 
force is in error by 0.001 percent or less, and the distributions 
within the adhesive are affected by a similar negilgible fraction 
of the experimental uncertainties in adhesion measurements.* 
However, the discontinuous charge distribution at x=a was 
overlooked in determining the total charge within the dielectric. 
Its effect is to change the expressions for the surface charge 
density on the metal,‘ which should read Noexo, and to cause the 
adhesive contribution to be NokT, both being irrespective of the 
thickness of the dielectric either before or after break. This 
requires obvious alterations to Eqs. (6) to (8.3) inclusive, and 
minor alterations in the discussion of the work of Meissner and 
Merrill, in the second and third paragraph of V, 2, and in section 
VI, 1, where the words “repulsion” and “reversal of charge” 
should be replaced by “enormously reduced attraction’® and 
“greatly reduced charge,” respectively. In section VI, 1, the 
discussion on the dependence of the force on the sample thickness 
is inapplicable. 

2. As Mr. Morant says, the factor 2kT InK,/K: in the potential 
jump between the dielectrics (at x=d) should be divided by two 
if our computation is followed through. However, his discussion 
(sections 2, 3, and 5) of voltages, Fermi levels, etc., does not take 
into account the full thermodynamic picture, and confuses Fermi 
levels with inner electrostatic potentials. Either the Volta potential 
or the inner potential can be used, but whichever one is chosen 
must be followed through consistently. For the Volta potential, 
the boundary conditions are (Fig. 1) 0:(0) =0, |U2(D)| = @.—¢: 
(not zero), and the usual electrostatic conditions on potential and 
field at the dielectric-dielectric boundary. If the inner potential is 
used, the difference at the latter boundary is | x2—x:]| if the three 
strictly accurate sets of transcendental equations are used. But 
then it is necessary to consider the thermodynamics of the 
metal-dielectric boundary : 

(a) With no external applied potential, the Fermi level (being 
the free energy per particle) equalizes throughout both metals 
and dielectric. The Volta potential difference in the metals raises 
the energy of the electron vapor outside one metal with respect to 
the other; the external electron densities are related by a Boltz- 
mann factor containing this energy difference, namely :* 


leVO| = g2—¢1; N20/Nio=exp[— (¢2— ¢1)/kT]; (1) 
Njo=N. expl— ¢;/kT]; Ne=2(2emkTh)! ? 


(b) In a nonmetal, when a space charge exists, n=n; 
Xexp[(Er—5)/kT, where ¢ is the intrinsic Fermi level, and Er 
the quasi-Fermi level (imref) ; therefore, 


U=t—kT Inni+c=Er—kT Inn+c, (2) 


in which ‘VU is the electronic charge times the inner electrostatic 
potential, and c is the same for all substances in thermodynamic 
equilibrium. If there is no space charge, a rise in the Fermi level 
means an increased electrostatic potential. With space charge, 
the constant height of the Fermi level means that the concentra- 
tion m of electrons toward the interior decreases just enough to 
make up for the increase of the inner electrostatic potential due to 
space charge. 

Since the electron densities are different at the two boundaries, 
the inner electrostatic potentials will be different there. By 
generalization of Eqs. (3) to the atmosphere in a dielectric and 
use of Eq. (2) and the value c=kT InN,, we have 


V1 (0) = Ert+ (¢:—x1); U2(D) = Er+ (¢2—x2). (3) 


Equation (3) or its equivalent obtained by adding the same 
constant to both potentials is the correct boundary condition on 
the inner potentials, and ¢ cannot make both potentials zero. 
Equations (3) and the requirement that the potential be 
continuous and have a maximum in one dielectric must be 
satisfied. With the “finite” equations, there then are specified 
the potential jump at x=d, and the relation of the electrical 
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fields. With the approximate solution the maximum is no longer 
smooth; also it is necessary to relax one condition. A double 
layer or a charge sheet must be permitted at x=d; the latter 
introduces considerable error, the former negligible error, so it is 
chosen. Using these boundary conditions, the expressions in the 
paper are obtained, except that the right-hand side of (3) above 
appears in the constants in the potential expressions; the space 
dependence of the inner potentials, the fields, and charge densities, 
and the computed adhesive forces are those of the paper. The 
energy gap is kT InK,/Kz independent of either ¢ or x. The 
deisred quantities, namely charge transferred and adhesive 
forces, are accurate to within a fraction of experimental error. 
The difference from the solution in the paper does not affect 
the adhesion to the metal, but takes account of a transfer of 
electrons between the dielectrics themselves; this equalizes Fermi 
levels in them and produces an electrostatic attraction between 
them if they are solids. 

3. See Morant’s Sec. 5: By the expressions, the maximum inner 
potential difference (with respect to the interface) attainable 
within even a 2-cm thick layer of adhesive at room temperature 
and minimum xo is 0.75 v and reasonable values are 0.1-0.7 v. 
The energy gap in insulators is from 2 to 7 volts. However a more 
fundamental point is involved: the charge density decreases as 
the inner potential increases, and the Fermi level remains constant ; 
in the general nonmetal, it is known that electrons are raised 
thermally to conduction levels only if the Fermi level moves up. 
Field emission requires much higher (externally applied) fields 
and a decreasing rather than increasing inner potential. Com- 
mercial insulators and plastics are insulators precisely because 
precaution has been taken to avoid impurity levels in the intrinsic 
gap. Thermal contributions are not therefore a limitation on 
the inner electrostatic potential in insulators, though they may be 
in semiconductors. 

4. With respect also to Morant’s Sec. 5, mathematically it is 
possible to assume insulator resistance to be due to small charge 
concentration or small mobility. It is quite possible that the 
mobility is smaller than in semiconductors, but to attribute the 
small conductivity entirely or even in major part to smaller 
mobility raises conceptual difficulties. (1) Astronomical periods are 
required to reach equilibrium. (2) Even amorphous substances 


have molecular structure. Since polystyrene, for example, has - 


10-"* the conductivity of germanium, attributing the insulating 
value to poor mobility requires assuming an electron mean free 
path 10~* that in germanium, i.e., less than 10 cm. Such a 
value has little physical significance ; even reasonable modifications 
of assumed electron temperature do not correct the difficulty. 
Electron concentration in the conduction level, however, depends 
exponentially on the intrinsic energy gap; a gap of 0.7 to 2 ev 
would be sufficient to explain, on the basis of charge concentration, 
observed insulating properties. Kimball’ has shown that in 
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diamond the gap may be as great as 7 ev. The question can, of 
course, only be settled by measuring both m and b by experimental 
methods suitable to insulating materials. 

5. Morant’s Sec. 3 appears to refer to the legend on Fig. 5(b) (2) 
of (I). This is the charge density at x=d not at the metal; at the 
metal, m goes to No not 0, though at d it reaches a fraction of No. 
We agree with him that the present treatment does not explain 
frictional electrification, and did not make such claim. 

6. The intention of the article was to point out that the transfer 
of electrons from a metal into a dielectric adhesive could bring 
about a significant electrostatic contribution to the measured 
adhesion. Such transfer does not depend upon the properties of 
the valence bonds or energy levels of the dielectric alone (which 
determine the intrinsic energy gap), but upon the energy gap at 
contact, i.e., between the Fermi level of the metal and the conduc- 
tion level of the adhesive; therefore the force cannot be predicted 
from the measured chemical properties of the adhesive alone. 
It is our understanding that in at least two commercial research 
laboratories the subject is now being investigated in detail; the 
results of such investigations together with ours should yield 
useful data on contacts between adhesives or similar materials 
and metals, and help to determine the range of applicability of 
the present considerations. 


* With the support of the Aeronautical Research Laboratory, Wright 
Air Development Center, U. S. Air Force. 

t Since much of what follows was in manuscript when the preceding 
letters were received, their acknowledgment was to be incorporated 
therein. A delay in the second article makes it desirable to treat the points 
raised, directly. For lack of space much of the contents of correspondence 
to Dr. West and Mr. Morant is omitted and will be included elsewhere. 
The first article by the present authors is designated by (I). 

! Treated by v. Laue (as mentioned above); Richardson (references in 
Fowler, Statistical Mechanics); Fowler himself; Mott and Gurney, Elec- 
tronic Processes in Ionic Crystals (Oxford University Press, London, 1948) 
second edition; and F. Borgnis, Z. Physik 100, 117 (1936). 

2S. M. Skinner, J. Appl. Phys. (to be published). 

3 This is discussed more fully in the second article on the contributiont 
of electron{atmospheres to adhesion, Skinner, Savage, fandfRutzler, which 
will be submitted in the near future to J. Appl. Phys. 

‘The additional surface charge density at the metal due to the dis- 
continuous charge sheet is at most 4 percent and usually of the order of 
10-3 or less. 

5 The electrostatic component of adhesion at room temperature between 
a metal surface coming up in air to one side of a thin dielectric film already 
adhering on its other side to a metal surface will be far less than one micro- 
dyne cm~*, whereas the electrostatic component of the adhesion of the 
film to the first metal could be 600-1000 Ib in.~. 

*See R. H. Fowler and E. A. Guggenheim, Statistical Thermodynamics 
(Macmillan Company, New York, 1939), p. 486. 

7G. E. Kimball, J. Chem. Phys. 3, 365 (1935). 





Direct-Current Transients in Polymethyl 
Methacrylate and in Polystyrene 
PAUL EHRLICH* 


Diamond Ordnance Fuze Laboratories,t Washington 25, D. C. 
(Received February 17, 1954) 


UANTITATIVE measurements of the dc transients in 
materials with dc conductivities of the order of 107 
mho/cm and less have not been reported previously. This com- 
munication reports on such measurements made with a commercial 
Vibrating Reed Electrometer' on two widely used high polymer 
insulating materials. The procedure used was that of plotting on 
an automatic recorder the charge accumulated on the internal 
capacitor of the instrument. The voltage across this capacitor 
was 1 volt maximum which was only a small fraction of the battery 
voltage. The effect of the measuring instrument on the normal 
current flow through the specimen was therefore negligible. The 
samples were molded disks 7g in. thick with an effective electrode 
diameter of approximately 3 in. Data are presented for two 
different brands of commercial polymethyl methacrylate, a 
sample of polystyrene vacuum-polymerized at the National 
Bureau of Standards, as well as a commercial sample of the same 
polymer. Figure 1 is an illustration of the circuit used. 
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Fic. 1. Circuit used in conjunction with Vibrating Reed Electrometer 
(V.R.E.) for the measurement of very low dc conductivities. 


Figures 2 and 3 present data of charging and of discharge 
currents in polymethyl methacrylate and in polystyrene as a 
function of time. It is seen that in all cases measurable absorption 
currents continue to persist over the entire time scale and that, 
even at the longest times, it is impossible to report steady-state 
conductivities. In one of the polystyrene samples the steady-state 
conductivity must be only a small fraction of the smallest trans- 
ient conductivity observed, as can be seen from the identity of 
charging and discharge curves over the entire time scale.? In the 
other two cases, the charging curves bend upward and away from 
the discharge curves at the upper end of the time scale, indicating 
that the steady-state conductivities have become a substantial 
fraction of the total conductivities. There is a significant difference 
in the data for the two samples of polystyrene; data obtained for 
a different commercial brand of polymethyl methacrylate, 
however, gave, within experimental error, identical results with 
the ones presented, except that the charging curves did not bend 
away from the discharge curves at long times, indicating the 
absence of any substantial steady-state conductivity. 

It is of interest to examine the connection between the measure- 
ments reported and the ac loss factor. It is well known that a 
relation exists between the frequency-dependent dielectric 
constant e’(w) the loss factor e’’(w) and the dc transient current, 
i(t).3 Von Schweidler‘ has calculated apparent capacitances and 
resistances for a decay function of the form i(#)= At and ¢’(w), 
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Fic. 2. Logarithm of the dc conductivity of a sample of polymethyl 
methacrylate plotted as a function of the logarithm of the time after 
application (O,(J) and removal (A,V) of the potential. The second 
— each case refers to a run made under reversal of the battery 
terminals. 
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Fic. 3. Logarithm of the dc conductivity of a commercial sample of 
polystyrene plotted as a function of the logarithm of the time after applica- 
tion (O,() and removal (A,V7) of the potential. The second symbol in 


each case refers to a run made under reversal of the battery terminals. 
The filled in symbols refer to a sample of vacuum-polymerized polystyrene. 


as well as «’’(w), can be immediately obtained from this develop- 
ment with the following result for the loss factor :§ 


€” (w) = 1/CalGo/w+8Cw" "I (1—mn) cos(nx/2)], (1) 


where C, is the capacitance of a sample of air which has replaced 
the dielectric, Co is the capacitance of the dielectric at very high 
frequencies, Go is the steady-state dc conductivity, and A, 8, 
and m are constants. It is readily seen that the loss factor may 
increase, decrease, or remain constant with frequency, depending 
on the exponent m of the decay function. By a suitable variation 
of the exponent through the time scale, the loss factor may be 
made to assume any of the shapes ordinarily observed. Hamon 
has shown that, subject to certain requirements almost invariably 
satisfied, Von Schweidler’s method can be made the basis of an 
approximation which is valid even where the exponent is not 
constant over the entire time scale. Hamon obtains 


e’i(0.1/f)/2fC.V, (2) 
which may be written as 
e’~G(0.1/f)/2xfeo, (3) 


where i(0.1/f) and G(0.1/f) are transient current and transient 
conductivity, respectively, at a time equal to one tenth of the 
recipricol cps frequency, V is the applied voltage and ¢ is the 
permittivity of free space (8.854 10~" farads/cm). 

Table I presents values of the loss factor for polymethyl metha- 
crylate and polystyrene calculated from the transient measure- 
ments by Eq. (3), and values at higher frequencies obtained 


TABLE I. Loss factor of polymethyl methacrylate and polystyrene. 











Polystyrene Polystyrene 
Polymethyl Vacuum- commercial 
methacrylate polymerized sample 
Fre- ‘ Fre- Fre- 
quency Loss quency Loss quency Loss 
(cps) factor (cps) factor (cps) factor 
10-3 8.1 X1073 10-3 3.2 X10-* 10-3 11. K10-4 
6.7 X10-3 9.3 X10-3 1072 2.0 X10-* 107? 3.6 X10~* 
102 2.1 X107! 10? 2.5 X10-4 10? 2.8 X10~* 
1066 1.4X107 108 2.5 X10-* 108 2.8 X10-4 
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directly on a Schering bridge. Barring a sharp dispersion region 
between 10? and 10~* cps, polystyrene would therefore appear to 
undergo no very decided change in loss factor at room temperature 
over most of the interval considered. Polymethy! methacrylate, 
on the other hand, has to undergo roughly a 20-fold drop in loss 
factor in a little over four frequency decades. Judging from the 
behavior of this polymer at other temperatures and frequencies,® 
this value would seem quite high; nevertheless, polymethyl! meth- 
acrylate has been shown to have a 100 cps loss factor maximum 
at about room temperature® and a marked drop in loss factor on 
the low-frequency side of this maximum should, therefore, be 
expected. 

The author gratefully acknowledges helpful advice given to 
him by Dr. A. H. Scott and Mr. N. Doctor in setting up the 
measuring circuit. He is also indebted to Dr. L. A. Wall for provid- 
ng the vacuum-polymerized polystyrene. 

* Present Address: Department of Chemistry, Harvard University, Cam- 
bridge 38, Massachusetts. 

t Formerly a part of the U. S. National Bureau of Standards. 

! Palevsky, Swank, and Grenchik, Rev. Sci. Instr. 18, 298 (1947). 

2A de conductivity of 10- mho/cm for a very pure polystyrene 45 
minutes after application of the potential has just been reported in another 
investigation. Warnet, Muller, Nordlin, J. Appl. Phys. 25, 131 (1954). 

3K. S. Cole and R. H. Cole, J. Chem. Phys. 10, 98 (1942). 

4 E. Von Schweidler, Ann. phys. 24, 711 (1907). 


+ B. V. Hamon, Proc. Inst. Elec. Engrs. 99, Part IV, 151 (1952). 
6D. J. Mead and R. M. Fuoss, J. Am. Chem. Soc. 64, 2389 (1942). 





Optimal Loading of a Thermoelectric Generator 
JAMES F, PERKINS 
Convair, Fort Worth, Texas 
(Received March 3, 1954) 


ELKES' has derived an equation for the over-all efficiency of 
a thermoelectric generator, for which it was assumed that 
the maximum over-all efficiency would be obtained by matching 
the load resistance R, to the generator internal resistance R. Papet? 
has pointed out that, while for a given generator the maximum 
power output can be achieved by matching impedances, the 
optimal operating condition is one corresponding to a maximum 
over-all efficiency and that this can be obtained by using a load 
resistance, R,, somewhat greater than the internal resistance R 
of the tenerator. Following Papet’s suggestion, Greefi* has 
calculated efficiencies for various loading conditions. Some of the 
calculated efficiencies are in error by more than a factor of two, 
however, as a result of neglecting the Peltier effect. For instance, if 
one analyzes generator II according to the reasoning applied 
to cases III, IV, and V; one concludes that, since generator II is 
only 1/25 as large as generator I, it will absorb only 1/25 as much 
heat at the hot junction. Since the outputs of I and II are the 
same as we have: Efficiency of II=25X Efficiency of I= 25X 2.34 
percent = 58.8 percent. By comparison, the actual efficiency of 
II was calculated by use of Telkes’ formula to be 20.9 percent. 
If one generalizes Telkes’ equation to include the value of the 
load resistance, one obtains 


1 
R+R, ‘ 
A(FERS) 4 B(R+Ro*/(RR) 





Efficiency = 


where 
A=1/(Carnot efficiency) = T7”7/(Tu—Tc). 
Be 4.92 10-§(Tx+Tc) 
2(Tu—Tc) 
The condition for maximum efficiency, which is easily obtained by 
differentiating with respect to R,/R, is 


(R./R) optimal = (A +B)*/B, 
In the limiting case of A/B-0, i.e., the case of vanishing 


efficiency, the Peltier term does not contribute to the heat 
transfer. Hence, maximum efficiency corresponds to maximum 
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output, and thus is obtained by impedance matching, R,=R. In 
Greeff’s Case I, for example, we find A/B=1.546/10.3=0.15, 
and (R./R) optimal = 1.075. 

The limiting case A/B—~ corresponds to a generator whose 
thermal partial efficiency, as defined by Papet, is independent of 
R,. Over-all efficiency is thus proportional to electrical efficiency 
ne= R./(R+R,) and is maximized by choosing R, much larger than 
R, i.e., (R./R) optimal © : 

In Greeff’s case II we find A/B=1.546/0.412=3.74, R, optimal 
= 2.18R=4.36 ohms, Efficiencyoptimal = 24.0 percent as compared 
to Efficiency r, = R= 20.9 percent. 

At still higher efficiencies the improvement in efficiency obtained 
by proper loading becomes more significant. For a generator with 
e= 1000u1'/°C, operated at the temperature difference of 500°C 
considered by Greeff, we have Efficiency r,=r=28.6 percent, 
Efficiency optimal = 39.0 percent, (R./R) optimal = 4.0. 

1M. Telkes, J. Appl. Phys. 18, 1116 (1947). 


2 R. M. Papet, J. Appl. Phys. 19, 1180 (1948). 
*M. B. Greeff, J. Appl. Phys. 21, 943 (1950). 





Power Output of Thermoelectric Generators 


Maria TELKES 
Research Division, College of Engineering, New York University 
New York, N. Y. _ 
(Received May 25, 1954) 


HERMOELECTRIC generators—theoretically—may offer 

numerous advantages, when compared with conventional 
electric power generators. There are no moving parts, nothing to 
wear out, if thermal deterioration at the hot junction could be 
avoided. The efficiency of the modern power generating stations 
is around 30 percent. The writer’s calculations! indicated that such 
efficiencies may be approached, if thermocouple materials are 
available giving a thermoelectric power of 1000 microvolt/°C 
(one millivolt per °C), operating at a temperature difference of 
500°C, provided that the thermocouple materials have “normal” 
Wiedemann-Franz-Lorenz relation. This relation is expressed by 


k/o=k-p=2.45 10-°7, 


where k=specific heat conductivity, p=resistivity, «=specific 
electric conductivity, and 7=absolute temperature. When the 
thermocouple is operated in a temperature range with 7), hot 
junction and 7, cold junction temperature, the corresponding 
k, o, or p values should be used. 

The writer has constructed numerous experimental thermo- 
electric generators, operated at various temperature ranges. 
The results have been summarized! and another article on “Solar 
Thermoelectric Generators” will appear in the near future? 
The highest efficiency obtained by the writer was around seven 
percent, using thermoelectric materials of 300 to 400 microvolt /°C 
with moderate deviations from the normal Wiedemann-Franz- 
Lorenz relation. In addition many materials have been used, some 
of them with thermoelectric power values in excess of 1000 
microvolt/°C (but unfavorable k-p values), but it was impossible 
to obtain higher efficiencies, except the values predicted by the 
theoretical calculation. . 

Papet*® and Greeff* published additional calculations indicating 
that the efficiency of thermoelectric generators could be increased 
by structural changes and by variations in the external load 
resistance, R, as compared with the internal resistance R;. 
Anyone who has constructed thermoelectric generators would 
naturally wish to obtain the maximum power output and for 
this reason would change the R./R; ratio in the widest possible 
range. The writer has naturally done this, and found that the 
maximum power output could be obtained when R,/R; was 1, 
or between 1.05 and 0.95. 

Perkins,® in a recent letter to the editor, calculated that the 
R./R;i ratio change may increase the theoretically calculated 
efficiency of thermoelectric generators, provided that thermo- 
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electric materials are available with 1000 microvolt/°C thermo- 
electric power, and normal Wiedemann-Franz-Lorenz relation. 
Operating at a temperature difference of 500°C, the calculation 
indicated that the efficiency could be increased from some 29 
percent to 39 percent, in the above case, by using an R,/R; ratio 
of four. In the writer’s original article! it was stated that the 
calculations should be revised, for high thermoelectric power 
values, because the Thomson effect was neglected (but not the 
Peltier effect). 

Unfortunately all materials with high thermoelectric power 
values (500 to 1000 microvolt/°C) deviate considerably from the 
normal Wiedemann-Franz-Lorenz relation and it appears that 
an efficiency greater than seven percent has not been attained. 
Calculations alone will not increase the efficiency; it is essential 
that better thermoelectric materials should be developed, which 
have the desirable 1000 microvolt/°C, at a normal Wiedemann- 
Franz-Lorenz relation. For practical purposes operating with 
T,=800°K, T-=300°K, k-p=1310-* (as an average for the 
operating temperature range). If the specific heat conductivity & 
were as low as that of compact insulators k=0.01 (in watt-cm™ 
deg), the corresponding p should not be higher than 0.0013 
ohm-cm as an average in the operating temperature range. 
Materials of this type have not been found as yet. 

1M. Telkes, J. Appl. Phys. 18, 1116 (1947). 

2M. Telkes, J. Appl. Phys. 25, 765 (1954). 

3R. M. Papet, J. Appl. Phys. 19, 1180 (1948). 


*M. B. Greeff, J. Appl. Phys. 21, 943 (1950). 
5 J. F. Perkins, J. Appl. Phys. 25, 1058 (1954). 





Thermal Diffusivity of Metals at High 
Temperatures 


D. ROSENTHAL AND N. E. FRIEDMANN 


Department of Engineering, University of California, 
Los Angeles, California 


(Received February 27, 1954) 


NDER the above title an experimental procedure based on 

the modified Angstrom! method has been described recently.? 

It has been known since the inception of the original method by 
Angstrom (1861) that an application of a periodically varying tem- 
perature at one end of a long bar offers the following advantages: 


(1) The temperature of the source need not be known. 

(2) Radiation and convection losses need not be determined. 

(3) The value of thermal diffusivity can be obtained within a 
temperature range sufficiently narrow, so that it becomes 
practically a value at a fixed temperature. 


The modified method appears to possess an additional advan- 
tage of reducing the time of measurements to a single period, once 
the desired (mean) temperature has been attained in the whole 
specimen. 

What appears to be less widely known is that the Angstrom 
method is not the only one possessing the above advantages. The 
method based on the use of a moving heat source* offers the same 
advantages and it has the added advantage of covering wide 
ranges of temperature in one single run. Thus, even the waiting 
period for the attainment of a mean temperature is eliminated. 

The method is based on the establishment of a quasi-stationary 


state in a long bar swept by a moving heat source. During the’ 


sweep each point of the bar is subjected to a temperature rise on 
the approach of the source and temperature fall when the source 
moves away. It can be shown‘ that if a plot is made of the log 
temperature variation with time at a given point sufficiently 
removed from both ends of the bar, the slopes 8 and 8’ correspond- 
ing to the same temperature on the rising and falling portions of 
the plot are related to the thermal diffusivity a at this temperature 
as follows: 


a="/(8+/’). (1) 


Here 2 is the velocity of the heat source. 


LOG T (°F) 





TIME - SECONDS 


Fic. 1. Plot of time-log temperature data for 61S and 3S aluminum alloys. 


As an example, two typical plots of time-log temperature data 
are shown in Fig. 1. These data were taken from time-temperature 
records using equipment similar to that described by Sidles and 
Danielson.” 

Table I contains some data obtained by this method along with 
data secured by other methods or from other sources and included 
for the sake of comparison. With the present experimental tech- 
nique a reproducibility of 2 percent or better appears possible in 
all cases in which thermal diffusivity exceeds 0.2 cm?/sec. 

In spite of the advantages of both methods it is only fair to 
caution against undue emphasis on direct determinations of 


TABLE I. Thermal diffusivity of some alloys. 








Thermal diffusivity (cm?/sec) 





Moving heat source 





Temper- Num- Stand- 
ature ber ard 
Range of  devi- Other 
Alloy (°C) Value tests ation sources 





Yellow brass 0.346 0.365 
60/40 100-400 (1+0.000715T) 4 0.010 (1+0.001T)* 


Deoxidized 0.99> 
Copper 65.5-288 0.980 2 0.009 at 65.5°C 
Aluminum 

Alloy 61S 

Nominal 

Composition 0.6624 


0.673 
Mg-1.0, Cu-0.25 65.5-483 (1+0.0000107T) 31° 0.010 at 25°C 
Si-0.6, Cr-0.25 


Aluminum 

Alloy 3S 

Nominal 0.643 0.6054 

em 121-538 (1+0.0000523T) 7 0.014 at 25°C 
nil. 








® See reference 4, 18—200°C 

b Computed from electrical resistivity at 65.5°C using Franz-Wiedemann 
relation [C.S. Smith and Palmer, Trans. Am. Inst. Mining Met. Engrs. 117, 
225 (1935) ]. 

¢ Four source velocities and two specimen thicknesses. 

4 Metals Handbook (The American Society for Metals, Cleveland, 1948). 


thermal diffusivity. For one thing the knowledge of thermal 
diffusivity alone is not sufficient to solve transient heat problems 
in which thermal properties vary with temperature. The variation 
of heat conductivity with temperature also must be given.®:® 
For another thing, accurate methods for the determination of 
heat conductivity and heat capacity at various temperatures are 
available.':7§ Hence, thermal diffusivity for each temperature can 
be computed and the degree of accuracy appears to be at least 
equal, if not actually superior, to that afforded by direct methods. 

The use of direct methods is therefore a matter of expediency 
rather than necessity. This expediency, however, is greatly 
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justified by the present scarcity of complete sets of data on density 
and heat capacity over wide ranges of temperature for even the 
most common alloys.* 


* It might be mentioned that in certain cases it is possible to evaluate the 
dependence of thermal conductivity and heat capacity on temperature from 
the known temperature variation of another physical property, e.g., 
electrical resistivity. Such cases are of course limited. 

1H. S. Carslaw and J. C. Jaeger, Conduction of Heat in Solids (Clarendon 
Press, Oxford, 1948). 

? P. H. Sidles and G. C. Danielson, J. Appl. Phys. 25, 58-66 (1954). 

3D. Rosenthal, Trans. Am. Soc. Mech. Engrs., 68, 849-866 (1946). 

*D. Rosenthal and A. Ambrosio, Trans. Am. Soc. Mech. Engrs. 73, 
971-974 (1951). 

®*M. Jacob, Heat Transfer (John Wiley and Sons, Inc., New York, 1949) 
p. 192. 

*R. Plunkett, Trans. Am. Soc. Mech. Engrs., 73, 605-608 (1951). 

7G. J. Aitchison, J. Sci. Instr. 24, 200-202 (1947). 

*R. B. Rice, ASTM Bull. No. 157, 72-75 (1949), 





Multiple Beam Sharpening of Photoelastic Fringes 


DANIEL Post 
Naval Research Laboratory, Washington, D. C. 
(Received March 12, 1954) 


ONSIDERABLE attention has recently been given to 
mechanical techniques for increasing the precision of 
photoelastic measurements. Great strides have been made possible 
in this direction by improved techniques in model preparation, 
calibration, and fringe pattern analysis, and by compensator 
and photometric determination of fringe orders. The present 
contribution deals with a technique for transforming the shape of 
photoelastic fringes to make them more amenable to precise 
measurements. 

When partial mirrors of high reflectivity are placed on each side 
of a model, Fig. 1, part of the light traverses the model many times 
before emerging. For normal incidence these rays emerge from 
the same point, and the combined intensity varies with model 
birefringence according to the curve given in Fig. 2(a). The shape 
of the two beam interference fringes of conventional photoelastic 
isochromatic patterns is provided in Fig. 2(b) for comparison. 

These charts were obtained by replacing the model with a 
Babinet-Soleil Compensator in a standard circular polariscope, 
and imaging the compensator aperture upon the grid of a photo- 
multiplier tube. The compensator was driven at constant speed 
and the phototube output was amplified and recorded on a 
constant speed chart. The sharpened pattern was obtained by 
interposing mirrors of 85 percent reflectance (3 percent transmit- 
tance, 12 percent absorption) on each side of the compensator. 

Similar charts show that errors in the optical system, such as 
imperfect circular polarization and extraneous birefringence 
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Fic. 1. Light path through model and mirror system. Full and dashed lines 
represent light polarized in mutually perpendicular planes. 
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Fic. 2. Intensity curves for (a) sharpened photoelastic fringes (top) and 
(b) ordinary photoelastic fringes (bottom). 


outside the mirror surfaces, have no effect upon the positions of 
the sharpened peaks. These errors reduce the amplitude of the 
peaks and cause minor asymmetry in fringe shape. 

Rigorous mathematical analysis of this problem appears to be 
very difficult. However, a physical argument leads to the following 
intensity-retardation relationship : 

p=" 1 (1—R?) cos2aN | 

2 Li—R* 14+ R*—2R* cos4eN J’ 

where / is fringe intensity, Jo is incident light intensity, T and R 
are fractions of light intensity transmitted and reflected at each 
mirror surface, and WN is relative retardation, in wavelengths, 
caused by a single passage of light through the model. This is 
the exact solution for the summation of transmitted rays, assuming 
wave coherence for each pair of rays, A—A’, B—B’,---, arising 
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Fic. 3. Sharpened photoelastic fringe pattern for disk 
in diametral compression. 
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out of the birefringence, and noncoherence between other combina- 
tions of rays. The large distance between mirrors (in wavelengths), 
coupled with the finite band width of any monochromatic source, 
produces a randomization of phases having the essential properties 
and effects of noncoherent rays. 

The plot of Eq. 1 yields the same shape curve as given in Fig. 
2(a), but with somewhat sharper peaks. It should be remembered, 
however, that all imperfections in the experiment, such as finite 
size of source, finite spectral band, non-normal incidence, etc., 
produce some broadening of the sharpened fringes. 

Figure 3 shows a typical sharpened photoelastic fringe pattern— 
a circular disk in diametral compression. The distinct advantage 
over conventional patterns is apparent. Fringe positions can be 
established with far greater precision for these multiple-beam 
fringes than for the corresponding system of two-beam fringes. 

This technique should be most valuable for those problems 
where low fringe orders or low fringe gradients are important. A 
complete fringe distribution plot becomes possible even for these 
cases. Several sharpened fringe parameters can be observed and 
measured by utilizing monochromatic sources of different wave- 
lengths (such as the strong lines of the mercury arc). Further, 
the method of varying the magnitude of applied load for the 
purpose of displacing fringes to given regions becomes practical 
with the sharpened system. 





A Torsional Pendulum of Low Thermal Inertia* 
C. ANG AND C. WERT 
University of Illinois, Urbana, Illinois 
(Received March 23, 1954) 


HE application of anelastic methods to problems in metal- 
lurgy and the physics of solids has been in some instances of 
considerable value.+? Many of these applications have used, to 
measure the damping of the specimen, a simple torsional pendulum 
not much more complicated than the pendulum commonly used to 
measure rigidity modulus in physics laboratory courses.** Because 
of its simplicity, modifications in its use can easily be made to fit 
special needs of various problems. This account will describe a 
pendulum which was constructed for the study of damping in 
certain Au-Ni alloys.® 
In alloys of gold and nickel there is an interesting relaxation 
effect which is observed when the alloys are quenched from high 
temperatures, but this effect disappears fairly rapidly at tempera- 
tures which one must use to measure the relaxation. Hence, to 
study this effect properly, an apparatus with rather low thermal 
inertia is required. The pendulums commonly used in the past 
were unsatisfactory in this respect since heat transfer from the 
external furnace to the specimen was too slow.*‘* By the time a 
quenched specimen was mounted in the apparatus and heated to 
the proper temperature (some 300°C), the relaxation effect was 
nearly gone. Hence, some modification of the existing apparatus 
was required to eliminate this thermal sluggishness. 

A pendulum with low thermal inertia designed to measure such 
transient peaks is shown semischematically in Fig. 1. The specimen 
(in this case a 0.030 in. diameter wire) forms the suspension of a 
counterbalanced inverted torsional pendulum (counterbalanced so 
that the longitudinal stress on the specimen is small). The wire 
is heated by the Joule heat of a current supplied by a variable 
low-voltage ac source. Electrical contact is made directly at the 
bottom (fixed) end of the sample and through a pin dipping into a 
cup of gallium at the top (free) end. The heating rate can easily 
be made to be several hundred degrees C per minute; at any 
temperature the specimen can be maintained rather constant over 
long periods of time. 

The temperature of the specimen is measured with a thermo- 
couple spot welded directly to the wire. The thermocouple, which 
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Fic. 1. A schematic drawing of the principal features of the 
low-thermal-inertia pendulum. 


is made of wire of 0.002 in. to 0.010 in. diameter, is made to have 
many long, loose spirals and loops; it then adds no appreciable 
damping. The most serious defect of the apparatus lies in the 
nonuniformity of temperature along the length of the wire. The 
temperature falls off slowly at first from the middle to the two 
ends, then rapidly toward the grips which act as rather large heat 
sinks. For a typical 10-cm specimen nominally at 350°C, a series of 
0.002 in. Cu-constantan thermocouples showed the central 7 cm 
to be within 10°C of 350°C, and the last 13 cm at each end to have 
a steep gradient down to the grips. By using a 0.010-in. thermo- 
couple at the center to act as a slight heat sink at that point, the 
central 7-cm section was made uniform to 5°C at 350°C. 

This nonuniformity of temperature is much greater than would 
be tolerated in an apparatus using an external heating element, 
but does not actually seem very detrimental. Well-known relaxa- 
tion peaks in Ta were observed to be shifted slightly to higher 
temperatures (a few degrees) and slightly broadened. These effects 
could be reduced either by using longer specimens or by heating 
the grips in some fashion. 

A good vacuum can be maintained inside the metal bell to 
reduce oxidation at high temperatures. With the bell removed, the 
entire apparatus is easily accessible. Ferromagnetic specimens 
must be examined with caution, since large damping of magnetic 
origin may be observed with high ac currents needed for heating 
(up to 20 amperes). This effect would apparently be eliminated by 
using a dc heating current. 

on work was sponsored in part by the Office of Ordnance Research, 
. . pa Anelasticity of Metals (University of Chicago Press, Chicago, 
me Nowick, prevent in Metal Physics (Academic Press, Inc., New 
York, 1953), Vol. 4 
3T. S. Ké, Phys. 4 “71, 533 (1947). 
4C. Wert, Modern Research Techniques in Physical Metallurgy (Am, Soc. 


Metals, Cleveland, Ohio, 1953), p. 225. 
5 C. Ang and C. Wert (to be published). 
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Maxwell’s Demon Must Remain 
Incommunicado 
MARCEL J. E. GoLay 


Signal Corps Engineering Laboratories, Fort Monmouth, New Jersey 
(Received April 2, 1954) 


ZILARD' has indicated that Maxwell’s Demon cannot 
function because he lacks the information required for the 
operation of his trap door. This concept has been further explored 
by Brillouin? who has shown that the increase of entropy incidental 
to the operation of a sensing system for gathering the information 
needed about molecule movements exceeds the negentropy thus 
acquired for the Demon. These studies thus dispose effectively of 
any speculation that the entropy of information theory is different 
from that of thermodynamics, except for the factor —&lg,2. 

The purpose of this letter is to point out that even if the 
molecule sensing system were to operate at the expense of no 
entropy increase, the very process of communicating this informa- 
tion to the Demon would require too great an entropy increase. 

This can be shown very simply when the communication system 
consists of an ordinary transmission channel with negligible 
losses. The optimum information rate of such a channel is given 
by the now classical expression, Wigl1+(P/kTW) ], where W is 
the band width utilized, P is the signal power, and kTW is the 
noise power. For small powers, this formula indicates that the 
information entropy S; communicable for an expenditure of 
energy E, is related to the latter by the expression 


Si (E/RTge2). 


The thermodynamic negentropy communicable to the Demon 
is therefore limited to 


—Smn< (E/T), 


i.e., is smaller than the entropy increase which occurs when the 
electrical energy E utilized by the communication system is 
dissipated into heat at the temperature 7. 

It would be equally futile to attempt to communicate with the 
Demon by means of, say, a high-energy photon dispatched at one 
of many prearranged epochs (pulse time modulation). The rate 
of blackbody emission of photons in the frequency range Av from 
an area of the order of a half wavelength square is given by 
Av /e”/*T 1. The precision with which the time of arrival of a 
photon within the prearranged frequency range Av can be deter- 
mined is of the order of 1/Av, and the natural logarithm of the 
number of small time intervals of this order into which the 
large time intervals between photon transmissions can be divided 
should be of the order of Av/kT, in order to transmit information 
with the same energy economy as with a conventional communica- 
tion channel. This indicates that the rate at which photons should 
be transmitted Ave~’”/*7 is of the same order of magnitude as the 
blackbody emission from the smallest area on which photons can 
be concentrated. 

We must conclude, therefore, that the very postulation of a 
communication system, either with a stupid Demon who obeys 
orders only, or within a clever Demon between his molecule 
sensing nerves and his motor nerves, destroys his reason for being. 


1L. Szilard, Z. Physik 53, 840-856 (1929). 
?L. Brillouin, J. Appl. Phys. 22, 334 (1951). 





The Dynamic Mechanical and Electrical Behavior 
of Polymethyl Methacrylate 


Davip TELFAIR 
Earlham College, Richmond, Indiana 
(Received January 28, 1954) 


URING the past ten years a considerable interest has 
developed in the dynamic mechanical behavior of polymers. 
Studies in this field are not only of practical value in the rubber 
and plastics industries"? but are also of theoretical interest because 
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Fic. 1. Dielectric constant of polymethyl methacrylate vs temperature. 


they lead to a better understanding of the molecular processes 
which underlie the response of polymers to mechanical and 
electrical stress.3:4 

We have studied the effect of temperature on the mechanical 
damping capacity and modulus of elasticity in shear for polymethy! 
methacrylate by means of the torsional pendulum method 
described by Nielsen.’ Dielectric constant and power factor 
measurements within the temperature and frequency ranges 
indicated in Figs. 1 and 2 have also been made. 

Samples of the unplasticized polymer from two independent 
sources* were studied. Figure 3 illustrates the dependence of 
dynamic shear modulus and of mechanical damping capacity with 
temperature. Frequency for the torsional vibrations was held 
nearly constant (approximately 0.20 cycle per second). 

The dielectric data shown in Figs. 1 and 2 were obtained witha 
General Radio type 716-C capacitance bridge. A two-terminal 
dielectric sample holder was used, similar to the General Radio 
type 1690-A holder. Mineral oil from a thermostatically controlled 
bath was circulated through the grounded terminal and walls 
of the holder, in order to control temperature. The data shown in 
Figs. 1 and 2 are for a predried specimen of “heat-resistant” 
Lucite (supplied by du Pont), but are also characteristic of the 
corresponding grade of Plexiglas (supplied by Rohm and Haas). 

A few years ago Alexandrov and Lazurkin® published data on 
the response of polymethyl methacrylate to dynamic stressing 
ar various temperatures and frequencies. Fitzgerald and Ferry’ 
have found that an activation energy calculated from electrical 
data for polyvinyl chloride is approximately the same as that 
calculated from mechanical damping data. An activation energy 
for polymethyl methacrylate estimated from the data of Alex- 
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Fig. 3. Dynamic shear modulus and mechanical damping of 
polymethyl methacrylate vs temperature. 


androv and Lazurkin is not in satisfactory agreement with the 
value calculated from the electrical data indicated in Figs. 1 
and 2. 

McLoughlin and Tobolsky® have madea study of the viscoelastic 
properties of polymethyl methacrylate by means of stress relaxa- 
tion methods. Their data clearly exhibit the relatively strong 
dependence of the dynamic mechanical properties on moisture 
content. Some preliminary work in this laboratory indicates that 
the dielectric behavior is also sensitive to moisture. The results of 
further investigations will be reported later. 


* The Rohm and Haas Company, and the E. I. du Pont de Nemours & 
Company, Inc. 

1L. E. Nielsen, J. Am. Chem. Soc. 75, 1435 (1953). 

2 W. J. Lyons and I. B. Prettyman, J. Appl. Phys. 19, 473 (1948). 

3 Ferry, Grandine, and Fitzgerald, J. Appl. Phys. 24, 911 (1953). 

4 Fitzgerald, Grandine, and Ferry, J. Appl. Phys. 24, 650 (1953). 

5L. E. Nielsen, Am. Soc. Testing Materials Bull. No. 165, 48 (1950). 

6A. P. Alexandrov and J. S. Lazurkin, Acta Physicochim. (U.S. S. R.) 
12, 647 (1940). 

7E. T. Fitzgerald and J. D. Ferry, J. Colloid Sci. 8, 1 (1953). 

8 J. R. McLoughlin and A. V. Tobolsky, J. Colloid Sci. 7, 555-568 (1952). 





Hardening of Pd by Cathodic Hydrogen 


TAKESI SUGENO AND MASAMICHI KOWAKA 


The Institute of Industrial and Scientific Research, 
Osaka Univer sity, Sakai-shi, Osaka, Japan 


(Received April 2, 1954) 


HEN palladium absorbs hydrogen, the two phases, a and 

8, are known to form depending upon the concentration 

of hydrogen, and they are both of f.c.c. lattices but differing in 

lattice parameters and exist as segregated phases.! It is also known 

that Pd is hardened by the hydrogen absorption. We have meas- 

ured the hardness of the Pd specimens which were charged by 

cathodic hydrogen, with the micro Vickers hardness tester, and 

have obtained several interesting results in regard to hydrogen 
hardening processes. 

Figure 1 shows the relation between the hardness of Pd and 
the duration of hydrogen charging, where the curve a is for the 
perfectly annealed (1200°C —3 hrs) specimen, }, c, and d for the 
lightly, mediumly, and severely cold worked specimen, respec- 
tively. In the case of cathodic charging by the current density 
of 3.6 mA/cm?, it is shown from the electric resistance measure- 
ment that the a phase exists up to 10 min of charging and there- 
after it transforms to the 6 phase, and also shows that some 
curves have maxima and others do not. All curves in Fig. 1 show 
the increasing hardness during the initial ten min of charging. 
When hydrogen was charged long enough, the hardness reached a 
definite value. We know, therefore, that the increasing part of the 
hardening curve corresponds to the a phase and the part of a defi- 
nite value to the 8 phase. The final hardness value of the severely 
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cold worked specimen is nearly equal to the initial value, but 
other specimens have higher values of the final hardness than 
the initial. 

Figure 2 shows the change of hardness of the hydrogen-charged 
Pd specimen with discharging times, where the reversed electroly- 
sis is used for discharging. The initial state of specimens is the 
perfectly annealed. In the case of the a phase, the hardness 
decreases monotonously and quickly reaches its initial value, 
while the curve of the 8 phase once ascends with discharging times 
and reaches a definite value (higher than the initial value) through 
a maximum after a long time. 

From these experimental results, it becomes evident that the 
a and the 8 phase may have different mechanisms of hydrogen 
hardening. The hardness of the a phase increases when the 
hydrogen content increases, and reversibly decreases with de- 
creasing hydrogen content. After discharging had been finished, 
the specimen was again heated at 1200°C and microscopically 
investigated. We could not detect any observable change in its 
structure. We know, therefore, that the hydrogen hardened a state 
completely recovers to its initial annealed state only by discharging 
the hydrogen. 

The hardening mechanism of the 8 phase may be somewhat 
complicated. When the specimen was charged with hydrogen to a 
8 range, the slip-band-like traces appeared in the grain structures. 
These traces remained after the hydrogen had been perfectly 
discharged. The reason why the hydrogen hardened 8 phase 
specimen does not recover its initial softness after discharging 
hydrogen can be explained by these traces. After complete 
discharging, in this case, the hydrogen hardened Pd could be 
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recrystallized at 700°C. The essential nature of these traces and of 
the 8 phase are now under investigation. Some tentative explana- 
tion about the mechanism of hydrogen hardening phenomena in a 
8 range and the details of experimental results will be shown in 
another paper.” 

1D. P. Smith, Hydrogen in Metals (University of Chicago, Chicago, 1948). 


2 T. Sugeno and M. Kowaka, Mem. Inst. Sci. and Ind. Research, Osaka 
Univ. XI (1954). 





Elastic Relaxation and Grain Boundary Diffusion 
in Cadmium 
Y. L. Youser anp R. KAMEL 


Physics Department, University of Cairo, Giza, Egypt 
(Received April 7, 1954) 


NTERNAL friction and Young’s modulus relaxation, in 

relation to intergrain diffusion, were investigated for pure 
cadmium in the polycrystalline and in the single crystal states. 
The specimens were in the form of short wires contained in an 
evacuated tube which could be maintained at various tempera- 
tures. Transverse vibrations with amplitudes of the order of 
10~* cm at low sonic frequencies were set up electrostatically and 
were detected by a capacity-responsive circuit which has earlier 
been described in connection with sodium ion diffusion in glass." 
The internal friction Q-' was measured by the resonance curve 
method. Young’s modulus E was 4etermined at various tempera- 
tures simultaneously with the measurement of internal friction. 
The square of the frequency of vibration was taken as a measure 
of Young’s modulus. 

In conformity with the poineer work of K@ on aluminum by a 
torsion pendulum method, the internal friction versus temperature 
curve for polycrystalline cadmium showed a marked absorption 
band due to relaxation by intergrain diffusion. A relaxation in 
Young’s modulus versus temperature curve manifested itself as an 
inflection in the curve, and occurred in the same temperature 
range where the absorption band appeared (Fig. 1). In single 
crystal specimens prepared by the moving furnace method,’ a 
regular behavior was observed, indicating the absence of any 
relaxation process under the experimental conditions contemplated 
here. Since the observed absorption band and the relaxation of 
the elastic modulus in the polycrystalline cadmium disappeared 
completely in the case of single crystal specimens, the mechanism 


responsible for the present anelastic effect in polycrystals must be 


the diffusion of grain boundaries. 
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Fic. 1. Internal friction (@,O) and elasticity relaxation (X<,A) curves 
for cadmium in the ycrystalline (I and II) and in the monocrystalline 
(III and IV) states. The frequency at room temperature for curves I and II 
is 41.5 c/s and for curves III and IV is 48 c/s. 
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Fic. 2. Effect of change of temperature on the position of the internal fric- 
tion peak of two cadmium strips (8.35 X 0.125 X 0.078 cm and 2.65 X 0.125 
X< 0.078 cm). The frequency at room temperature for curve A is 41.5 c/s 
and for B is 385 c/s. Curve C represents the position of the peak as a 
function of frequency. 


The heat of activation H associated with grain boundary diffu- 
sion was deduced from Fig. 2 and was found to be 32 kcal/mole. 
According to Le Claire,‘ the value of H may be taken, without 
appreciable error, as equal to its value for self-diffusion. So far 
as we are aware, it seems that no results were previously published 
on the self-diffusion of cadmium. 

1 R. Kamel, J. Appl. Phys. 24, 1308 (1953). 

2 T. S. Ké, Phys. Rev. 71, 533 (1947); 72, 41 (1947); 73, 267 (1948). 

3 E. N. da C. Andrade and R. Roscoe, Proc. Phys. Soc. (London) 49, 152 


(1937). 
‘A. D. Le Claire, Phil. Mag. 42, 468 (1951). 





The Measurement of Electromagnetic Field in 
Resonant Cavities by Introduction of 
Small Metallic Rings 


ALDO GILARDINI 
Istituto Superiore PP.TT., Rome, Italy 
(Received April 5, 1954) 


N a recent paper' published in this journal, L. C. Maier and 

J. C. Slater have shown that the electromagnetic field in a 
resonint cavity can be determined by measuring the perturbation 
of the resonant frequency, caused by the insertion of a small 
metallic element at the point where the electromagnetic field 
has to be measured. The frequency shift is given by an equation 
which can be stated in the form? 


= 
ff. wile MaBA+ ME, M2E?) 


iy 
—y(N,H2+N,HZ+N2H?2)], 


where fo is the unperturbed resonant frequency, f the perturbed 
frequency, U the total electromagnetic energy stored in the 
cavity, E,, Ey, E., Hz, Hy, Hz, the six components of the electro- 
magnetic field at the point of perturbation in an arbitrary Cartesian 
coordinate system, M,, My, M., Nz, Ny, N. parameters having 
volume dimensions, related to the size and shape of the element 
and to its orientation in the coordinate system. (It is interesting to 
note that «M,E; and —yN;H; are the components along the i axis 
of the electric and magnetic moment of the perturbing element.) 

In order to determine the six components of the electromagnetic 
field a system of six equations is required; this system can be 
obtained by six frequency shift measurements, each with different 
shape or orientation of the perturbing element, supposing M and 
N parameters known. 

Maier and Slater have calculated these parameters for an 
ellipsoid of revolution and found good agreement with experiment 
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in the extreme cases of spheres and very thin disks. In the general 
case the agreement is much less satisfactory, probably, as the 
authors suggest, owing to the poor tolerances in machining an 
ellipsoid. 

By using spheres and very thin disks with different orientations 
obviously only four independent equations can be obtained and 
for the remaining two, general ellipsoids must be used. In order to 
improve the precision I have tried to substitute to the ellipsoid, in 
the last two measurements, another metallic element of a shape 
which can be more easily machined. A toroidal ring seems easier to 
machine and I have considered the possibility of using it. 

Let us assume that the rotation axis of a toroidal ring is directed 
along the z coordinate axis, and let r be the radius of the generating 
circle and R the radius of the circumference described by the 
center of the generating circle. M and N parameters can be 
calculated following the method illustrated by Maier and Slater 
for ellipsoids. The results are* 


2)3 
M,=4 RO?) p () 


(R/r)? 
—r*)i R 
aaa af r(® ) 


tt aoe —P)i(R 
N= Ny= 3p (R/r)? r(*) 


where F,, F2, and F; are functions of R/r, which have unity 
asymptotic values when R/r—~. F functions can be given as ex- 
pressions containing associated Legendre functions of semi-integer 
degree and have been calculated and plotted in Fig. 1 for 
R/r2>2. 

It is interesting to note that the relationship 4,= M,=2N, is 
also valid for ellipsoids. 

In Table I are reported the results of an experimental verifica- 
tion. Two tori have been used, the first with R=3.65 mm, 
r=0.73 mm, the second with R=3.09 mm, r=1.51 mm. They 
have been inserted in a rectangular cavity, operating in the TEjo« 


M,=M,=2N,=282————. 


TABLE I. 





Experimental 
value of Calculated 








parameter value Percentage 
Torus Parameter _f Ma.) (mm!) (mm!) of error 
5° M; me 91 642 627 2.4 
1° M: —0.48 79 76.5 3.3 
Ba Nz +0.19 59 56.6 4.2 
ge N: +0.98 303 313 3.2 
al M: —5A4l 888 867 2.4 
r M: —1.66 272 275 1.1 
2° Nz +0.62 192 195.5 1.8 
7 N: +1.36 420 434 3.2 
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mode (fo=3040 MHz), at the points of maximum electric field 
and of maximum magnetic field. Estimated error in Af is +10kHz, 
which is larger than Maier and Slater’s inaccuracy, owing to a 
different method of measuring Af, but is good enough for our 
purposes. The percentage errors reported in the last column 
show that perturbation theory for toroidal rings checks to within 
a few percent, as Maier and Slater already found for spheres and 
very thin disks. 

The writer wishes to express his hearty thanks to Professor 
V. Gori, Director of the Istituto Superiore PP.TT., for his kind 
encouragement and support in this study. 

1L. C. Maier, Jr., and J. C. Slater, J. Appl. Phys. 23, 68 (1952). 

2H. B. G. Casimir, Philips Research Reports 6, 162 (1951). 

3 Calculations are developed in a report of the Istituto Superiore PP.TT., 


which the author will be glad to send on request. Requests can be addressed 
to him: Viale Trastevere 189, Rome, Italy. 





Centripetal and Centrifugal Electromagnetophoresis 
ALEXANDER KOLIN* 
University of Chicago, Chicago, Illinois 
(Received April 7, 1954) 


T has been reported in a previous paper! that particles sus- 
pended in a liquid whose electrical conductivity differs from 
that of the particles experience a force when a current is passed 
through the liquid at right angles to a magnetic field which 
permeates the liquid. To avoid convection of the fluid, the current, 
as well as the magnetic field, must be homogeneous. The magnetic 
field of flux density B and the current i may either be constant or 
alternating; in the latter case a maximum force is obtained when 
B and i are in phase. The direction of the force F is perpendicular 
to B and j (j being the current density in the suspension). Par- 
ticles whose conductivity (0’) exceeds that of the liquid (¢’’) 
experience a force in the direction of the Lorentz forces exerted 
upon the moving ions, whereas particles of o’<o”’ experience a 
force in the opposite direction. Particles of o’=0” experience no 
force. 

The phenomenon is the result of a combination of body forces 
and surface forces exerted upon suspended particles subjected 
to the above-mentioned conditions. The body force is an electro- 
magnetic force exerted upon the current passing through the 
particle; the surface force is a combination of hydrostatic forces 
(brought about by a hydrostatic pressure field set up in the 
fluid in the electromagnetic field) with hydrodynamic forces 
manifested as a result of fluid streaming engendered in the fluid 
adjacent to the particle. The situation is roughly analogous to 
particles suspended in a field of gravity, where the electromagnetic 
body force is the analog of the weight and the surface force is 
that of the buoyancy. 

A theoretical estimate of the force upon small spheres, taking 
viscous drag into account,’ predicts a force 


F=S(Bxiv[ 2", (1) 


where V is the volume of the sphere. 
For nonconducting spheres suspended in a conducting fluid 
(o’<e"’) one obtains, thus, 


F=5(BxilV. (2) 


This expression has been verified by observing the rate of 
migration of polystyrene spheres of diameters in the order of 
magnitude of 100 « suspended in NaCl solutions. 

The main experimental difficulty in observing this effect 
consists in the exclusion of electromagnetically engendered 
convection. The slightest nonuniformity in the magnetic field or 
current density may give rise to liquid convection which could 
mask the effect. The cell containing the suspension must, therefore, 
be made with great precision so as to minimize variations in the 
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Fic. 1. Generator of solenoidal 
magnetic field and cylindrical electro- 
magnetophoresis cell C. W: Thin 
water-cooled copper tubing. 7: and 7:: 
Heavier copper tubing (# in. od; 

in. id) conveying the current to W. 

1 and Bz: Heavy copper bars (1 in. in 
diameter, 32 in. long connecting 7: and 
T: to the secondary of the stepdown 
transformer. (Distance between B: and 
B:: 30 in. Length of W: 10 in.). E: and 
E:: Electrodes. j: Current density 
vector parallel to W. F: Vector repre- 
senting the centripetal force exerted 
a suspended nonconducting par- 
ticles. 





cross section perpendicular to j and the greatest care must be 
taken to maximize the uniformity of the magnetic field. 

Because of the practical difficulties involved in obtaining 
intense magnetic fields of great uniformity, it appeared desirable 
to investigate the possibility of observing the effect with non- 
uniform fields. This is indeed possible by using combinations of 
electric and magnetic fields which satisfy the condition curl 
[jX B]=0. A field combination which appears to be especially 
favorable consists of the solenoidal magnetic field surrounding a 
very long wire (carrying a current 7) with a current of uniform 
density j traversing the cell parallel to the wire (Fig. 1). The 
electromagnetic forces exerted upon the fluid as well as upon the 
particles, being perpendicular to both B and j, are radial. Thus, 
suspended particles experience centripetal or centrifugal forces, 
depending on the phase relation between B and j. For particles 
which are small as compared to the radius of the wire W, Eq. (1) 
can be assumed to hold with a good degree of approximation. 

The arrangement shown in Fig. 1 enables one to obtain intense 
magnetic fields within a small volume adjacent to the surface of 
W, which is adequate for work with small particles. Currents of 
several thousand amperes can be readily obtained with powerful 
stepdown transformers. Magnetic fields of sufficient symmetry 
can be obtained by making wire W part of a very large rectangular 
loop which is energized by the stepdown transformer. The 
“magnet”? W can be conveniently cooled by using copper tubing, 
rather than a solid wire. In our experiments currents of up to 
about 3500 amperes were passed through a copper tubing of 
is in. outside diameter and ,y in. inside diameter. Thus, the value 
of H at.the surface of the wire was approximately 2800 oersteds. 
The tubing W was cooled by tap water passing through it at the 
approximate rate of 12 liters/minute. j varied from 0.1 amp/cm? 
to 1 amp/cm*. For example, suspensions of polystyrene spheres 
of average diameter of 115y (ranging from 25y to 400u) have been 
“centrifuged” successfully. At a current of J=3050 amp passing 
through the tubing W with a current of density j=0.65 amp/cm? 
traversing the suspension, the precipitation around the central 
tubing W was nearly complete (except for a few spheres of the 
smallest size) within 5 seconds and practically complete within 10 
seconds, In dense suspensions the deposit of particles will be heavy 
and not uniform. The nonuniform deposit creates an electro- 
magnetic convection in its vicinity which tends to redistribute the 
particles uniformly so that the convection slows down and finally 
stops. 

It is not necessary to use a cylindrical cell surrounding W as 
shown in Fig. 1. Cells of a great variety of forms can be used. A 
semicylinder, or better, a cell of the cross section shown in Fig. 2 


Fic. 2. Cross section of 
semicylindrical electromag- 
netophoresis cell. L: Lucite 
“washer,” a transparent 
wall (S mm high) sand- 
wiched between two silver 
electrodes. (The electrodes 
are parallel to the plane of 
the drawing and are not 
shown in the figure. They 
are cemented, for greater 
rigidity, to Lucite plates.) 
I and O: Channels through 
which the cell can be filled. 
F: Flat walls through which 
the migration of the par- 














' ‘ 

' F iW, F ticles toward W or away 

\ — | | from it may be observed. 
46 | | W: Copper tubing which 

| 3" serves as electromagnet. 

' Ky ! 


is particularly convenient. W indicates the location of the wire 
which generates the magnetic field. The transparent Lucite 
“‘washer” L shown in Fig. 2 is sandwiched between two similarly 
shaped silver electrodes. The migration of the particles toward the 
center of the wire W or away from it can be easily observed 
through the flat Lucite walls F of the cell. It is important to align 
the planes of the electrode, E, and E; (Fig. 1) exactly perpendicular 
to the wire W in order to minimize electromagnetic convection. 
Thin silver electrodes (3/1000 in.) have been used to avoid 
excessive heating of the electrodes by eddy currents thus reducing 
thermal convection. 

The author wishes to express his indebtedness to the Research 
Corporation and to the Smith, Kline and French Foundation for 
generous support of his research. Thanks are also due Mr. Edward 
Wolowiec for technical assistance. 

* This work has been supported by a grant from the U. S. Office of Naval 
Research. 


1A. Kolin, Science 117, 137 (1953). 
? D. Leenov and A. Kolin, J. Chem. Phys. 22, 683 (1954). 





On Shock Propagation in Brass 
H. BEcKER* 
Nuclear Power Division, Combustion Engineering, Inc., New York, New York 
(Received May 20, 1953) 


N a recent paper, Shreffler and Deal' (SD) presented velocity- 
thickness data for the free surface motion of brass plates 
under explosive detonation. Savitt? (S) plotted the reciprocal of 
the initial free surface velocity from (SD) as a function of plate 


thickness, and obtained a sharp discontinuity at about } in. 











e ITA 
\" 






































mm ys. v 
3 <= 
oa Oa 
“ha 
'S 2 4 t ins «f 4a 


Fic. 1. »-¢ plot. 





sity 
1 


liq 


for 
pr 
(c 
ele 
tic 








lar 
yn. 
id 
ng 


for 
rd 


val 


ork 


ity- 
ites 
| of 


late 








LETTERS TO THE EDITOR 1067 

















Fic. 2. Velocity zones. 


thickness. Actually, a clearer picture of the behavior of the plates 
may be obtained by plotting all (SD) velocities against plate 
thickness. This has been done in Fig. 1, from which it is apparent 
that several curves may be drawn connecting the velocity values, 
thus yielding lines of demarcation for velocity zones in the plates. 
This has been done in Fig. 2 for which the fine lines were taken 
from Fig. 5 (SD). The demarcation zones are clearly indicated, 
thus revealing cut-off thickness values at which velocity ranges 
in the thicker plates are no longer possible in the thinner plates. 
It would appear, from the sharp slope changes of the (SD) plots, 
that the velocity zones are defined by specific deflection modes, 
some of which may exist in thick plates but may not exist in thin 
plates. Furthermore, it could be presumed that a sharp change in 
slope of the line for any particular thickness may be due to a 
change in mode shape. 

* Now at New York University, Engineering Research Division, Univer- 
sity Heights, New York. Revised manuscript received on May 20. 


1R. G. Shreffler and W. G. Deal, J. Appl. Phys. 24, 44 (1953). 
2 J. Savitt, J. Appl. Phys. 24, 1335 (1953). 





On the Conduction of Heat into a Growing 
Vapor Bubble 


H. Kurt FORSTER 


Department of Engineering, University of California, 
Los Angeles, California 


(Received Apri! 16, 1954) 


N a recent paper on the integration of Rayleigh’s equation for 
the problem of a vapor bubble growing in a superheated 
liquid we used a simple expression for the temperature of the 
moving bubble wall with very good success.! An exact analysis is 
forthcoming but in view of possible further applications to related 
problems, particularly to the problem of the collapsing bubble 
(cavitation) and the bubble: growing under the influence of 
electric charges due to ionizing radiation the detailed considera- 
tions which led us to our formulation may be of interest. 
The two-phase heat conduction problem with phase boundary 
moving in consequence of the phase change is quite old and was 


first solved for some special cases by Franz Neumann as reported 
in Riemann’s lectures.? Carslaw and Jaeger who also substantially 
contributed to this problem furnish a rather complete list of 
references.* A paper dealing with the case of a moving spherical 
boundary between liquid and vapor was published a few years ago 
by Plesset and Zwick,‘ who assumed a “thermal boundary layer” 
around the vapor bubble and obtained a first approximation for 
the bubble temperature in a method of successive approximations. 
In our present treatment, we use Lightfoot’s® idea of representing 
the heat liberated or absorbed in phase change by sources or 
sinks, but instead of solving the heat conduction and (heat) 
convection problem at once and together we solve the more 
important of the two problems (heat conduction) first and 
exactly, and then, having a general solution of the head conduction 
problem valid for all bubble radii, we investigate the influence of 
the radial velocity on the temperature at the phase interface. 

Our system consists of a vapor bubble of radius Ro embedded 
in an infinite fluid. Specific heat, thermal conductivity and 
diffusivity and heat of evaporation will be denoted by c, K, x, 
and L and subscripts one and two will indicate the vapor or liquid 
phase, respectively. Initially, liquid and vapor are at the same 
temperature (boiling point plus superheat) which is taken as the 
zero of our temperature scale. 

If the radius of the bubble increases by dRo, an amount of heat 
dQ=Lp:4rR*dRp is absorbed constituting a negative distributed 
spherical surface heat source. The heat of evaporation is so large 
that its influence is the only important contribution toward the 
subsequent temperature distribution. To orient ourselves about 
the nature of the problem we shall first disregard the bubble wall 
curvature; this is a useful consideration because the essential 
features can be seen very clearly for the plane interface and, as 
will be shown later, consideration of curvature merely introduces 
some rather simple changes. 

We assume the phase interface in the (y,z) plane with vapor 
for x<0 and liquid for x>0; the initial condition is T;=7,=0 
except for x=.’ and the boundary conditions are 


T,=T>2 and gin x2! at the interface. (1) 
on, One 


The temperature distribution due to an instantaneous distributed 
plane source dQ’=dQ/4rR¢? at x=2’ is then® 


en Oe. a 
aT (x1) = | ex B(x—x’)*) 
Koxit— Kixo* 
Koxi'+Kixot 


where B= (4xt)~! [we should correctly divide by c2p2; lest the 
formulas become too cumbersome we have done so only in our 
final Eqs. (9) and (10) ]. In our case the heat source dQ’ is at the 
interface, thus x’=0; and in order to obtain the temperature at 
the bubble wall (which we shall always denote by T without 
subscript) we set x=0, giving 


T= aad 
(x 


1 
xl)» Ey) (3) 
14-4 & 
TK 1 


The constant factor which appears in Eq. (3) is very close to 
1, e.g. for water at about 100°C it is (1.0032)~'. Its physical 
significance may be found in the fact that the ratio Kix2#/K2x1:1 
(which is here numerically about 3: 1000) gives the proportion of 
heat flowing into the source at the bubble wall from the vapor 
region to that coming from the region filled with liquid. 

If we now consider the (small) spherical bubble, this ratio will 
become even smaller because the total heat contained in the 
extremely small initial bubble (its radius is about 10~* cm for a 
few degrees of superheat) is vanishingly small and then, as the 
bubble grows, the vapor emanating from the bubble wall by 
evaporation is below the initial temperature of the system. We 
shall therefore choose as our boundary condition at the bubble 
wall of the spherical bubble: no heat flux across the phase interface. 


exp(—a(et+2'))], (2) 
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Our problem is therefore to find the temperature distribution in 
the liquid due to a spherical surface heat source of strength 1 at 
R’ with the boundary condition 


(me. r,=9; Ro SR’; T+ finite at infinity. (4) 

The solution of the problem is the Green’s function for the 
domain for the stated conditions; it does not seem to be given in 
the literature and we derived it along lines given in various books 
for similar problems.’ The Laplace transform 7: of the tempera- 
ture in the liquid 7: is obtained as 


T.= (eRR'x:q)~+| exp(—q| R—R’}) 


qRo—1 
qgRo+1 


for Ro <R< R’; q stands for (p/x2)!, and p is the transformation 
variable in the usual notation. The inverse transform of Eq. (5), 
which may be obtained from tables, yields the temperature in 
the liquid. In our case we wish to know the temperature at the 
phase interface (and therefore we let R approach Ro) when 
evaporation takes place at the interface (R’—Ro) which gives us 
the transform of the bubble wall temperature due to a unit heat 
source 


oa 


exp(q(2Ro—R’— »)| » (5) 


T= lim lim T,=[4rRoxee(gRo+ 1) J". (6) 
R’—R, RR 
The temperature of the bubble wall of a bubble of radius Ro 
due to a source dQ at Ro then becomes 


dQ _ dQerfcl (x2t)!/ Ro] exp (xat/Ro*) 
4eRe (wxot)! 4rRo ; 


We note that in consequence of the factor /! in the denominator 
the important contributions to the temperature take place in the 
neighborhood of /=0; in fact, if we take the ratio of the second 
to the first term in Eq. (7) and write x for the dimensionless 
quantity (x2t)!/Ro we see that it is 


dT= 





(7) 


o(x) =x e”* erfcx, (8) 


which is zero for x=0. Let us further note that the first term in 
Eq. (7) is identical with the solution for the plane given by Eq. 
(3) for K,=0, and that the ratio given by Eq. (8) remains small 
for longer times as we take larger radii. 

If we now consider the influence of the phase boundary motion 
on the bubbie wall temperature we see directly that for the plane 
this influence is nil, i.e., this temperature is given by Eq. (3) or 
the first term of Eq. (7) whether the phase boundary moves or not. 

The physical significance of the dimensionless quantity x may 
be seen from the fact that (xsé)! is essentially the distance traversed 
by the temperature wave* and < is therefore this distance over the 
bubble radius. This consideration gives us a means to distinguish 
between slow and fast growing bubbles, depending on whether the 
bubble growth velocity is smaller or larger than the velocity of 
the temperature wave. In our case, for a few degrees of superheat' 
the bubble growth velocity is larger than the velocity of the 
temperature wave by a factor of about ten. More detailed con- 
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siderations® show that +x, starting from zero will remain smaller 
than about 0.05 during our time interval of interest which is less 
than 0.1 second. The ratio g(x) then remains smaller than 0.1 and 
we may neglect the second term in Eq. (7) compared to the first 
since we thereby introduce a small error in a small quantity. 

Concerning the influence of radial motion of the bubble wall on 
the first term in Eq. (7) we first note that the bubble, if left at 
radius Ro will at time ¢ have a temperature ~1/R,*; if, however, 
immediately after liberation of dQ the bubble is expanded to 
radius R, then its temperature at time / will be ~1/R? [and it 
will be given by the first term of Eq. (7) with Ro replaced by R]. 
In reality the bubble wall has in time ¢ moved from Ry to R anda 
mean value of 1/R? inserted in Eq. (7) should furnish a good 
approximation to the actual temperature. To obtain such a 
mean value we integrate 1/R? from Ro to R and divide by the 
interval; this gives 1/RRo. This value, inserted into the first term 
of Eq. (7) together with the expression for dQ as given previously 
yields 


Lp: RoR 
oo Pt od Ro . (9): 
Cop2R (t) (arxet)* 
The temperature which the bubble attains in consequence of the 
evaporation taking place continuously while it grows from Ro to 
R is now simply the sum over all the contributions given by 


Eq. (9) 
ral tR(x)R(x)dx 
c2p2(mx2)* J R()(t—2x)* 


Equation (10) gives the temperature of a (fast) growing spherical 
vapor bubble whose center is at rest in an infinite liquid. 

The temperature integral just derived is identical with the 
expression used in the treatment of the bubble growth problem in 
reference 1. It is interesting to point out that the bubble growth 
problem is quite insensitive to the exponent of (Ro/R) in Eq. (10). 
In fact, if [R(x)/R(d)]" (n a positive integer) is used in Eq. (10), 
the exponent m (a) leaves the solution of the bubble growth integral 
equation in the vicinity of ‘=0° absolutely unaffected and (b) 
leaves this solution for large times unaffected in its form (Constant 
-t#), merely changing the constant by a factor, which is, moreover, 
of order unity for any reasonable n. This weak dependence on 
the exponent is a consequence of the occurrence of (t—<x)? in 
the denominator of Eq. (10) whereby the contributions in the 
neighborhood of x~? [i.d., when R(x)/R(t)~1] are much more 
important than the subsequent contributions. 





(10) 


1H. K. Forster and N. Zuber, J. Appl. Phys. 25, 474 (1954). 

2 Riemann-Weber, Die Partiellen Differential Gleichungen der Mathemalti- 
schen Physik (Vieweg und Sohn, Braunschweig), fifth edition, pp. 117-121. 

3H, S, Carslaw and J. C. Jaeger, Conduction of Heat in Solids (Oxford 
University Press, London) first edition, p. 71. This work served as a general 
reference in the present study. 

4M. S. Plesset and S. A. Zwick, J. Appl. Phys. 23, 95 (1952). 

5 Lightfoot, Proc. London Math. Soc. (2) 31, 97 (1929). 

6 See reference 3, p. 300. 

7 See for instance, reference 3, chapter XII. 

* Let the temperature have a maximum at Ro at / =0; then a temperature 
maximum will occur at Ro+(2«t)? after time ¢; we shall call the velocity o 
this disturbance the velocity of the temperature wave. 

8 See reference 1, Eqs. 13 and 14. 

9 See reference 1, Eq. 11. 





